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CROSS-REFERENCE 

This application: a)claims the benefit of U.S. Provisional Application No. 60/418,042, 
filed October 1 1, 2002, and U.S. Provisional Application No. 60/426,212, filed November 14, 
2002, and b) is a CIP of of PCX Application No. US02/24655, filed August 2, 2002, which 
application claims the benefit of U.S. Provisional Application No. 60/309841, filed August 3, 
2001, and U.S. Provisional ApplicationNo. 60/360061, filed February 25, 2002, and c) is a CIP 
of U.Sr Non-Provisional Application No. 10/080,273, filed February 19, 2002, which 
application claims the benefit of U.S. Provisional Application No. 60/269,523, filed February 
16, 2001, and d) is a CIP of U.S. Non-ProvisionalApplication No. 09/724,553,filed November 
28, 2000, and e) is a CIP of U.S. Non-Provisional Application No. 09/570,1 18, filed May 12, 
2000, which application claims the benefit of U.S. Provisional Application No. 60/134,114, 
filed May 14, 1999, all of which applications are incorporated herein by reference in their 
entirety. 

FIELD OF THE INVENTION 

The present invention relates to small molecules, peptides, peptide analogues, 
proteins, and methods for using such compositions to regulate signalling pathways in cells. In 
one aspect, the invention provides methods of modulating localization or function of receptors 
that bind heterotrimeric G proteins by antagonizing or promoting binding between a PDZ 
domain containing protein and a protein that binds a PDZ domain. 

BACKGROUND 

G-Protein-Coupled-Receptors (GPCRs) constitute the largest family of cell 
surface molecules involved in signal transmission. These receptors play key physiological roles 
and their dysfunction results in diseases or disorders such as immune and cardiovascular 
disorders including asthma and inflammation, neurological disorders including anxiety, memory, 
depression, and cognition, endocrine disorders and more. Their importance in physiological 
systems has made them one of the most-often targeted classes of proteins for drug discovery. 

Estimates of the number of different GPCRs in the human genome range from 



300 to over 1000. This makes drug discovery a complex process requiring significant trial and 
error in the identification of compounds that will inhibit a single GPCR. In an effort to reduce 
the complexity of this effort, many groups are attempting to develop drugs that inhibit 
interactions between hetero- or homo-dimerized GPCRs, regulatory features such as 
5 phosphorylation sites, or inhibition of specific G-protein binding to GPCRs. There still exists 
a great need to be able to effectively identify therapeutics that target this class of proteins. 

One class of GPCRs in need of additional therapeutic inhibitors is the alpha 
adrenergic receptors. Six alpha adrenergic receptors have been identified at this time: alpha 1 A, 
IB and IC and alpha 2 A, 2B and 2C. Alpha 1 receptors have been shown to mediate actions 

10 in the sympathetic nervous system through binding of hormones such as catecholamines, 
epinephrine and norepinephrine. Alpha 2 receptors have been shown to play roles in regulting 
neurotransmitter release froiri sympathetic and adrenergic neurons in the central nervous 
system. The tissue distributions differ between members of each group of receptors, arguing 
a need for type specific or sub-type specific therapeutics. Specific antagonists and agonists 

15 of certain alpha adrenergic receptors (aAR's) have been identified, but the pharmacokinetic 
profiles of certain alpha 1 adrenergeric receptors (al AR*s) demonstrate that they penetrate the 
blood brain barrier, potentially giving rise to adverse side effects (Pool JL. Int, Urol. Nephrol 
2001, 33(3):407). However, several indications merit therapeutic targeting of brain functions, 
so the need for blood brain barrier penetrance will be receptor type and disease specific. Alpha 
. 20 1 receptors have been experimentally implicated in depression, lower urinary tract storage, 
migraines, prostate apoptosis, and hypertrophy/proliferation/migration of vascular smooth 
muscle following carotid balloon injury. Alpha 2 receptors have been experimentally implicated 
in migraine, glucose metabolism, coronary flow reserve after stenting, Alzheimer^s, Parkinson's, 
neuroprotection, glaucoma,and opioid withdrawl management. We have demonstrated binding 

25 between alpha adrenergic receptors and PDZ proteins, thus allowing a novefset of targets to 
. treat the disorders listed above. 

We have identified that PDZ proteins can organize and regulate the expression 
and function of a subset of GPCRs. PDZ domain-containing proteins have since been shown 
to regulate a myriad of cellular functions firom vesicular trafficking, tumor suppression, protein 

30 sorting, establishment of membrane polarity, and apoptosis. A common function of this family 
is to facilitate the assembly of multi-protein complexes, often serving as a bridge between 
several proteins. By possessing multiple PDZ domains, many PDZ-containing proteins act 
as organizers within the cell by increasing the local concentration of one or more proteins, and 



by regulating the localization of the clusters through interactions with the cytoskeleton or other 
organelles. One such protein, EBP50 has been shown to be an essential link between the 32- 
adrenergic receptor and the actin cytoskeleton, regulating its proper endocytosis and recycling 
to the plasma membrane. Another protein containing multiple PDZ domains, PDZKl, is 
5 essential for regulating ion conductance and polarized membrane distribution of the cystic 
fibrosis chloride channel. Others contain intrinsic enzymatic activity, and use their PDZ 
domains to localize the enzyme with its appropriate substrates. Thus, PDZ domains represent 
an important means by which the cell regulates the organization, localization, and function of 
proteins. The function of PDZ domains in certain biological systems is described, for example, 

10 in published PCT applications that are commonly pwhed by the assignee of . the instant 
application (see, e.g., WO 00/13161, WO QO/69898 and WO 00/69897) , each of which is 
incorporated herein by reference in its entirety for all purposes). PDZ interactions with their 
ligands have been shown to be amenable to therpaeutic intervention (Aarts et al.. Science 2002, 
298:846), thus underscoring the therpaeutic potential for these interactions. 

15 The following publications are of interest: Stone (2003) 

Neuropsychopharmacology 28(8): 1387-99; Djavan (2003) Urology 62:6-14; Willems (2003) 
Cephalalgia 23(4):245-57;Anglin (2002) Prostate Cancer Prostatic Dis. 5(2):88-95;Pool(2001) 
Int Urol Nephrol 33(3):407- 12; Roehrborn (2002) 59:3-6; . Yelliquette (2003) J Pharmacol Exp 
Ther 306(2):646-57; Stewart (2002) Circulation 106(23):2946-54;Gregorini (2002) Cirulcation 

20 106(23):2901-7;Debeir (2002)Neurosci 115(l):41-53;Teeters(2003) Am J Physiol Heart Cu-c 
Physiol 284(l):H385-92; Savola (2003) Mov Disord 18(8):872-83; Wheeler (2003) Surv 
Ophthalmol 48supl:S47-51; Tatton (2003) Surv Ophthalmol 48supl:S25-37; Cowing (2003) 
Cochrane Database Sst Rev (2):CD002024; Pupo (2002) BMC Pharmacology 2:17-33. 

In addition, the following patents and patent applications are of interest: 

25 Soppet, 5,994,506; Pausch 6,406,871. 

SUMMARY 

Methods and compositions for modulating biological function in a variety of 
cell types (e.g., hematopoietic, neuronal, brain, stem, epidermal and epithelial) are provided 
herein. These methods and compositions can be utilized to treat various maladies such as 
30 immune disorders, nervous system disorders and muscle disorders, for example. More 
specifically, these methods and compositions are for modulating binding between certain 
PDZ proteins and PL protein binding pairs as shown in Table 8. 



Certain methods involve introducing into the cell an agent that alters binding 
between a PDZ protein and a PL protein in the cell, whereby the biological function is 
modulated in the cell, and wherein the PDZ protein and PL protein are a binding pair as 
specified in Table 2. In some of these methods, the agent is a polypeptide comprising at 
5 least the two or three carboxy -terminal residues of the PL protein. 

Screening methods to identify compounds that modulate binding between 
PDZ proteins and PL peptides or proteins are also provided. Some screening methods 
involve contacting under suitable binding conditions (i) a PDZ -domain polypeptide having 
a sequence from a PDZ protein, and (ii) a PL peptide, wherein the PL peptide comprises a 

10 C-terminal sequence of the PL protein, the PDZ -domain polypeptide and the PL peptide 
are a binding pair as specified in Table 2; and contacting is performed in the presence of the 
test compound. Presence or absence of complex is then detected. The presence of the 
complex at a level that is statistically significantly higher in the presence of the test 
compound than in the absence of test compound is an indication that the test compound is 

15 an agonist, whereas, the presence of the complex at a level that is statistically significantly 
lower in the presence of the test compound than in the absence of test compound is an 
indication that the test compound is an antagonist. 

Modulators of binding between a PDZ protein and a PL protein are also 
described herein. In certain instances, the modulator is (a) a peptide comprising at least 3 

20 residues of a C-terminal sequence of a PL protein, and wherein the PDZ protein and the PL 
protein are a binding pair as specified in Table 2; or (b) a peptide mimetic of the peptide of 
section (a); or (c) a small molecule having similar functional activity with respect to the 
PDZ and PL protein binding pair as the peptide of section (a). The modulator can be either 
an agonist or antagonist. Such modulators can be formulated as a pharmaceutical 

25 composition. 

Methods of treating a disease correlated with binding between a PDZ protein 
and a PL protein are also disclosed herein, the method comprising administering a 
therapeutically effective amount of a modulator as provided herein, wherein the PDZ 
protein and the PL protein are a binding pair as specified in Table 2. As indicated supra, 



such methods can be used to treat a variety of diseases such as neurological disease, an 
immune response disease, a muscular disease, or a cancer. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 FIGURE 1 shows the interactionbetween Interleukin 8 receptor A (IL8RA) and 

the PDZ proteins MAGIl (domain 2 of 6), TIPl (doiriain 1 of 1) and MINT2 (domains 1 & 

2) in the in vitro ''G" Assay. For each of the three PDZ proteins, the OD (A450) of the 
interaction with IL8RA is shown in dark gray. The negative control for each of these three 
reactions is the interaction of GST with IL8RA'peptide, the results of which are shown in light 
10 gray. 

DESCRIPTION 

1. Definitions 

A "fusion protein" or "fusion polypeptide" as used herein refers to a composite 
1 5 protein, i.e., a single contiguous amino acid sequience, made up of two (or more) distinct, 
heterologous polypeptides which are not normally fused together in a single amino acid 
sequence. Thus, a fusion protein can include a single amino acid sequence that contains two 
entirely distinct amino acid sequences or two similar or identical polypeptide sequences, 
provided that these sequences are not normally found together in the same configuration in a 
20 single amino acid sequence found in nature. Fusion proteins can generally be prepared using 
either recombinant nucleic acid methods, i.e., as a result of transcription and translation of a 
recombinant gene fusion product, which fusion comprises a segment encoding a polypeptide 
of the invention and a segment encoding a heterologous protein, or by chemical synthesis 
methods well known hi the art. 

25 

A "fusion protein construct" as used herein is a polynucleotide encoding a 

fusion protein. 
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As used herein, the term "PDZ domain" refers to protein sequence (i.e., 
modular protein domain) of approximately 90 amino acids, characterized by homology to the 
brain synaptic protein PSD-95, the Drosdphila septate junction protein Discs-Large (DLG), 
and the epithelial tight junction protein ZOl (ZOl). PDZ domains are also known as Discs- 
5 Large homology repeats ("DHRs") and GLGF repeats. PDZ domains generally appear to 
maintain a core consensus sequence (Doyle, D. A., 1996, Cell 85: 1067-76). 

PDZ domains are found in diverse membrane-associated proteins including 
members of the MAGUK family of guanylate kinase homologs, several protein phosphatases 
and kinases, neuronal nitric oxide synthase, and several dystrophin-associated proteins, 
10 collectively known as syntrophins." 

Exemplary PDZ domain-containing proteins and PDZ domain sequences are 
shown in TABLE 6. The term "PDZ domain" also encompasses variants (e.g., naturally 
occurring variants) of the sequences of TABLE 6 (e.g., polymorphic variants, variants with 
conservative substitutions, and the like). Typically, PDZ domains are substantially identical 
1 5 to those shown in TABLE 6, e.g., at least about 70%, at least about 80%, or at least about 90% 
amino acid residue identity when compared and aligned for maximum correspondence. 

, As used herein, the term "PDZ protein" refers to a naturally occurring protein 
containing a PDZ domain. Exemplary PDZ proteins include CASK, MPPl, DLGl, PSD95, 
20 NeDLG, TIP33, SYNla, TIP43, LDP, LIM, LIMKl, LIMK2, MPP2, NOSl, AF6, PTN-4, 
prIL16, 41.8kD, KIAA0559,. RGS12, KIAA0316, DVLl, TIP40, TIAMl, MINTl, 
KIAA0303, CBP, MINT3, TIP2, KIAA0561, and those listed in TABLE 6. 

As used herein, the terma"PDZ-domain polypeptide" or "PDZ polypeptide" 
25 refer to a polypeptide containing a PDZ domain, such as a fusion protein including a PDZ 
domain sequence, a naturally occurring PDZ protein, or an isolated PDZ domain peptide. 

As used herein, the term "G-protein coupled receptor," or "GPCR," refers to 
a naturally occuring polypeptide receptor, or a polynucleotide encoding a receptor, known to 

6 



interact with G-proteins or have homology to proteins known to interact with G proteins. In 
addition, this definition includes polypeptide receptors, or polynucleotides encoding receptors, 
that are similar to those known to interact. with G-proteins. A partial list of known GPCR's 
is presented in TABLE 3. 

5 . " ^ 

As used herein, the term "PL protein" or "PDZ Ligand protein" refers to a 
naturally occurring protein that forms a molecular complex with a PDZ-domain, or to a protein 
whose carboxy-terminus, when expressed separately from the fiill length protein (e.g., as a 
peptide fragment of 4-25 residues, e.g., 8, 10, 12, 14 or 16 residues), forms such a molecular 
10 complex. The molecular complex can be observed in vitro using the "A assay" or "G assay" 
described infra, or in vivo. Exemplary PL proteins listed in TABLE 2 are demonstrated to. bind 
specific PDZ proteins. This definition is not intended to include anti-PDZ antibodies and the 
like. 

As used herein, GPCR-PL refers to a PDZ ligand sequence that occurs within 
15 a GPCR polypeptide sequence. 

As used herein, a "PL sequence" refers to the amino acid sequence of the C- 
terminusof aPLprotein (e.g., the C-terminal 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14, 16, 20 or 25 
residues) ("C-terminal PL sequence") or to an intemal sequence known to bind a PDZ domain 

20 ("intemal PL sequence). " 

As used herein, a "PL peptide" is a peptide of having a sequence from, or based 
on, the sequence of the C-terminus of a PL protein. Exemplary PL peptides (biotiriylated) are 
listed in TABLE 2. 

25 * 

As used herein, a "PL fusion protein" is a fusion protein that has a PL sequence 
as one domain, typically as the C-terminal domain of the fiision protein. An exemplary PL 
fusion protein is a tat-PL sequence fusion. 
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As used herein, the term "PL inhibitor peptide sequence" refers to PL peptide 
amino acid sequence that (in the form of a peptide or PL fusion protein) inhibits the interaction 
between a PDZ domain polypeptide and a PL peptide (e.g., in an A assay or a G assay). 

As used herein, a "PDZ-domain encoding sequence" means a segment of a 
polynucleotide encoding a PDZ domain. In various embodiments, the polynucleotide is DNA, 
RNA, single stranded or double stranded. 

As used herein, the terms "antagonist" and "inhibitor,"when used in the context 
of modulating a binding interaction (such as the binding of a PDZ domain sequence to a PL 
sequence), are used interchangeably and refer to an agent that reduces the binding of the, e.g., 
PL sequence (e.g., PL peptide) and the, e.g., PDZ domain sequence (e.g., PDZ protein, PDZ 
domain peptide). 

As used herein, the terms "agonist" and "enhancer," when used in the context 
of modulating a binding interaction (such as the binding of a PDZ domain sequence to a PL 
sequence), are used interchangeably and refer to an agent that increases the binding of the, e.g., 
PL sequence (e.g., PL peptide) and the, e.g., PDZ domain sequence (e.g., PDZ protein, PDZ 
domain peptide). 

As used herein, the terms "peptide mimetic, " "peptidomimetiq" and "peptide 
analog" are used interchangeably and refer to a synthetic chemical compound which has 
substantially the same structural and/or functional characteristics of an PL inhibitory or PL 
binding peptide of the invention. The mimetic can be either entirely composed of synthetic, 
non-natural analogues of amino acids, or, is a chimeric molecule of partly natural peptide amino 
acids and partly non-natural analogs of amino acids. The mimetic can also incorporate any 
amount of natural amino acid conservative substitutions as long as such substitutions also do 
not substantially alter the mimetic 's structure and/or inhibitory or binding activity. As with 
polypeptides of the invention which are conservative variants, routine experimentation will 

8 



determine whether a mimetic is within the scope of the invention, i.e., that its structure and/or 
function is not substantially altered. Thus, a mimetic composition is within the scope of the 
invention if it is capable of binding to a PDZ domain and/or inhibiting a PL-PDZ interaction. 

Polypeptide mimetic compositions can contain any combination of nonnatural 
structural components, which are typically from three structural groups: a) residue linkage 
groups other than the natural amide bond ("peptide bond") linkages; b) non-natural residues in 
place of naturally occurring amino acid residues; or c) residues which induce secondary , 
structural mimicry, i.e., to induce or stabilize a secondary structure, e.g., a beta turn, gamma 
turn, beta sheet, alpha helix conformation, and the like. 

A polypeptide can be characterized as a mimetic when ail or some of its 
residues are joined by chemical means other than natural peptide bonds. Individual 
peptidomimetic residues can be joined by peptide bonds, other chemical bonds or coupling 
means, such as, e.g., glutaraldehyde, N-hydroxysuccinimide esters, bifimctional maleimides, 
N,N=-dicyclohexylc£irbodiimide (DCC) or N,N=-diisopropylcarbodiiimide (DIC). Linking 
groups that can be an alternative to the traditional, amide bond ("peptide bond") linkages 
include, e.g., ketomethylene(e.g., -C(=0)-GH2- for -C(=0)-NH-), aminomethylene (CH2-NH), 
ethylene, olefin (CH=CH), ether (CH2-O), thioether (CH^-S), tetrazole (CN4-), thiazole, 
retroamide, thioamide, or ester (see, e.g., Spatola (1983) in Chemistry and Biochemistry of 
Amino Acids, Peptides and Proteins, Vol. 7, pp 267-357, A Peptide Backbone Modifications, 
Marcell Dekker, NY). 

A polypeptide can also be characterized as a mimetic by containing all or some 
non-natural residues in place of naturally occurring amino acid residues. Nonnatural residues 
are well described in the scientific and patent literature; a few exemplary nonnatural 
compositions useful as mimetics of natural amino acid residues and guidelines are described 
below. 

Mimetics of eiromatic amino acids can be generated by replacing by, e.g., D- or 
L- naphylalanine; D- or L-.phehylglycine; D- or L-2 thieneylalanine; D- or L-1, -2, 3-, or 4- 
pyreneylalanine; D- or L-3 thieneylalanine; D- or L-(2-pyridinyl)-alanine; D- or L-(3- 
pyridinyl)-alanine; D- or L-(2-pyrazinyl)-alanine; D- or L-(4-isopropyl)-phenylglycine; D- 



(trifluoromethyl)-phenylglycine;D-(trifluoromethyl)-phenylalanineP-p- 

D- or L"p-biphenylphenylalanine; K- or L-p-methoxybiphenylphenylal£inine; D- or L-2- 

indole(alkyl)alanines;and, D- orL-alkylainines,wherealkylcanbe substituted or unsubstituted 

methyl, ethyl, propyl, hexyl, butyl, pentyl, isopropyl, iso-butyl, sec-isotyl, iso-pentyl, or a 

non-acidic amino acids. Aromatic rings of a nonnatural amino acid include, e.g., thiazolyl, 

thiophenyl, pyrazolyl, benzimidazoiyl, naphthyi, furanyi, pyrrolyi, and pyridyl aromatic 

rings. 

Mimetics of acidic amino acids can be generated by substitution by, e.g., non^ 
carboxylate amino acids while maintaining a negative charge; (phosph9no)alanine; sulfated 
threonine. Carboxyl side groups (e.g., aspartyl or glutamyl) can also be selectively modified 
by reaction with carbodiimides (R=-N-C-N-R=) such as, e.g., l-cyciohexyl-3(2-morpholinyl- 
(4-ethyl) carbodiimide or l-ethyl-3(4-£izonia-4,4-dimetholpentyl) carbodiimide. Aspeirtyl or 
glutamyl can also be converted to asparaginyl and glutaminyl residues by reaction with 
ammonium ions. 

Mimetics of basic amino acids can be generated by substitution with, e.g., (in 
addition to lysine and arginine) the amino acids ornithine, citruUine, or (guanidino)-acetic acid, 
or (guanidino)alkyl-aceticacid, where alkyl is defined above. Nitrile derivative (e.g., containing 
the CN-moietyinplaceof COOH)canbe substituted for asparagine or glutamine. Asparaginyl 
and glutaminyl residues can be deaminatedto the corresponding aspartyl or glutamyl residues. 

Arginine residue mimetics can be generated by reacting arginyl with, e.g., one or 
more conventional reagents, including, e.g., phenylglyoxal, 2,3-butanedione, 1,2- 
cyclohexanedione, or ninhydrin, preferably under alkaline conditions. 

Tyrosine residue mimetics can be generated by reacting tyrosyl with, e.g., 
aromatic diazonium compounds or tetranitromethane. N-acetylimidizol and tetranitromethane 
can be used to form O-acetyl tyrosyl species and 3-nitro derivatives, respectively. 

Cysteine residue mimetics can be generated by reacting cy steiny 1 residues with, 
e.g., alpha-haloacetates such as 2-chloroacetic acid or chloroacetamide and corresponding 
amines; to give carboxymethyl or carboxyamidomethyl derivatives. Cysteine residue mimetics 
can also be generated by reacting cysteinyl residues with, e.g., bromo-trifluoroacetone, alpha- 
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bromo-beta-(5-imidozoyl)propionic acid; chloroacetyl phosphate, N-alkylmaleimides,3-nitro- 
2-pyridyl disulfide; methyl 2-pyridyl disulfide; p-chloromercuribenzoate; 2-chloromercuri-4 
nitrophenol; or, chloro-7-nitrobenzo-oxa-l,3-diazole, 

Lysine mimetics can be generated (and amino terminal residues can be altered) 
by reacting lysinyl with, e.g., succinic or other carboxylic acid anhydrides. Lysine and other 
alpha-amino-containing residue mimetics can also be generated by reaction with imidoesters, 
such as methyl picolinimidate, pyridoxal phosphate, pyridoxal, chloroborohydride, 
trinitrobenzenesulfonic acid, O-methylisourea, 2,4, pentanedione, and transamidase-catalyzed 
reactions with glyoxylate. . 

Mimetics of methionine can be generated by reaction with, e.g., methionine 
sulfoxide. Mimetics of proline include, e.g., pipecolic acid, thiazolidine carboxylic acid, 3- or 
4- hydroxy proline, dehydroproline,3- or 4-methylproline,or 3,3,-dimethylproline. Histidine 
residue mimetics can be generated by reacting histidyl with, e.g., diethylprocarbonate or para- 
bromophenacyl bromide. ' 

Other mimetics include, e.g., those generated by hydroxylation of proline and 
lysine; phosphorylation of the hydroxyl groups of seryl or threonyl residues; methylation of 
the alpha-amino groups of lysine, arginine and histidine; acetylation of the N-terminal amine; 
methylation of main chain amide residues or substitution with N-methyl amino acids; or 
amidation of C-terminal carboxyl groups. 

A component of a naturar polypeptide (e.g., a PL polypeptide or PDZ 
polypeptide) can also be replaced by an amino acid (or peptidomimetic residue) of the opposite 
chirality. Thus, any amino acid naturally occurring in the L-configuration (which can also be 
referred to as the R or S, depending upon the structure of the chemical entity) can be replaced 
with the amino acid of the same chemical structural type or a peptidomimetic, but of the 
opposite chirality, generally referred to as the D- amino acid, but which can additionally be 
referred to as the R- or S- form. 

The mimetics of the invention can also include compositions that contain a 
structural mimetic residue, particularly a residue that induces or mimics secondary structures, 
such as a beta turn, beta sheet, alpha helix structures, gamma turns, and the like. For example, 

11 



substitution of natural amino acid residues with D-amino acids; N-alpha^methyl amino acids; 
C-alpha-methyl amino acids; or dehydroamino acids within a peptide can induce or stabilize 
beta turns, gamma turns, beta sheets or alpha helix conformations, Beta turn mimetic structures 
have been described, e.g., by Nagai (1985) TetLett. 26:647-650; Feigl (1986) J. Amer. Chem. 
5 Soc. 108:181-182; Kahn(1988) J. Amer. Chem. Soc. 110:1638-1639; Kemp(1988) Tet. Lett. 
29:5057-5060; Kahn (1988) J. Molec. Recognition 1:75-79. Beta sheet mimetic structures have 
been described, e.g., by Smith (1992) J. Amer. Chem. Soc. 1 14:10672-10674. For example, a 
type VI beta tum iriducedby a cis amide surrogate, 1,5-disubstituted tetrazol, is described by 
Beusen (1995) Biopolymers 36: 1 8 1-200. Incorporation of achiral omega-amino acid residues 

10 to generate polymethylene units as a substitution for amide bonds is described by Banerjee 
(1996) Biopolymers 39:769-777. Secondary structures of polypeptides can be analyzed by, 
e.g., high-field lHNMRor2DNMR spectroscopy, see, e.g., Higgins (1997) J. Pept. Res, 
50:421-435. See also, Hruby (1997) Biopolymers 43:219-266, Balaji, et al., U.S. Pat No. 
5,612,895. 

15 . 

As used herein, "peptide variants" and "conservative amino acid substitutions" 
refer to peptides that differ from a reference peptide (e.g., a peptide having the sequence of the 
carboxy-terminus of a specified PL protein) by substitution of an amino acid residue having 
similar properties (based on size, polarity, hydrophobicity , and the like). Thus, insofar as the 

20 compounds that are encompassed within the scope of the invention are partially defined in 
terms of amino acid residues of designated classes, the amino acids may be generally categorized 
into three main classes: hydrophilic amino acids, hydrophobic amino acids and cysteine-like 
amino acids, depending pririiarily on the characteristics of the amino acid side chain. These 
main classes may be further divided into subclasses. Hydrophilic amino acids include amino 

25 acids having acidic, basic or polar side chains and hydrophobic amino acids include amino acids 
having aromatic or apolar side chains. Apolar amino acids may be furthersubdividedto include, 
among others, aliphatic amino acids. The definitions of the classes of amino acids as used 
herein are as follows: 

" Hvdrophobic Amino Acid" refers to an amino acid having a side chain that is 
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uncharged at physiological pH and that is repelled by aqueous solution. Examples of 
genetically encoded hydrophobic amino acids include He, Leu and Val. Examples of non- 
genetically encoded hydrophobic amino acids include t-BuA. 

" Aromatic Amino Acid " refers to a hydrophobic amino acid having aside chain 
containing at least one ring having a conjugated 7i-electron system (aromatic group). The 
aromatic group may be further substituted with groups such as alkyl, alkenyl, alkynyl, 
hydroxyi, sulfanyl, nitro and amino groups, as well as others. Examples of genetically encoded 
aromatic amino acids include Phe, Tyr and Trp! Commonly encountered non-genetically 
encoded aromatic amino acids include phenylglycine, 2-naphthylalanine, P-2-thienylalanine, 
1 ,2,3,4-tetrahydroisoquinoline-3-carboxylic acid, 4-chloro-phenylalanine, 2-fluorophenyl- 
alanine, 3-fluorophenylalanine and 4-fluorophenylalanine. 

" Apolar Amino Acid " refers to a hydrophobic amino acid having a side chain 
that is generally uncharged at physiological pH and that is not polar. Examples of genetically, 
encoded apolar amino acids include Gly , Pro and Met. Examples of non-encoded apolar amino 
acids include Cha. 

" Aliphatic Amino Acid " refers to an apolar amino acid having a saturated or 
unsaturated straight chain, branched or cyclic hydrocarbon side chain. Examples of genetically 
encoded aliphatic amino acids include Ala, Leu, Val and He. Examples of non-encoded aliphatic 
amino acids include Nle. 

" Hydrophilic Amino Acid" refers to an amino acid having a side chain that is 
attracted by aqueous solution. Examples of genetically encoded hydrophilic amino acids 
include Ser and Lys. Examples of non-encoded hydrophilic amino acids include Cit and hCys. 

" Acidic Amino Acid " refers to a hydrophilic amino acid having a side chain pK 
value of less than 7. Acidic amino acids typically have negatively charged side chains at 
physiological pH due to loss of a hydrogen ion. Examples of genetically encoded acidic amino 
acids include Asp and Glu. 

" Basic Amino Acid " refers to a hydrophilic amino acid having a side chain pK 
value of greater than 7. Basic amino acids typically have positively charged side chains at 
physiological pH due to association with hydroriium ion. Examples of genetically encoded 
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basic amino acids include Arg, Lys and His. Examples of non-genetically encoded basic amino 
acids include the non-cyclic amino acids ornithine, 2,3-diaminopropionic acid, 2,4- 
diaminobutyric acid and homoarginine. 

" Polar Amino Acid " refers to a hydrophilic amino acid having a side chain that 
is uncharged at physiological pH, but which has a bond in which the pair of electrons shared 
in common by two atoms is held more closely by one of the atoms. Examples of genetically 
encoded polar amino acids include Asx and Glx. Examples of non-genetically encoded polar 
amino acids include citruUine, N-acetyl lysine and methionine sulfoxide. 

" Cysteine-Like Amino Acid " refers to an amino acid having a side chain capable 
of forming a covalent Imkage with a side chain of another amino acid residue, such as a disulfide 
linkage. Typically, cysteine-like amino acids generally have a side chain containing at least one 
thiol (SH) group. Examples of genetically encoded cysteine-like amino acids include Cys. 
Examples of non-genetically encoded cysteine-like amino acids include homocysteine and 
penicillamine. 

As will be appreciated by those having skill in the art, the above classification 
are not absolute — several amino acids exhibit more than one characteristic property, and can 
therefore be included in more than one category. For example, tyrosine has both an aromatic 
ring and a polar hydroxy 1 group. Thus, tyrosine has dual properties and can be included in both 
the aromatic and polar categories. Similarly, in addition to being able to form disulfide linkages, 
cysteine also has apolar character. Thus, while not strictly classified as a hydrophobic or 
apolar amino acid, in many instances cysteine can be used to confer hydrophobicity to a 
peptide. 

Certain commonly encountered amino acids which are not genetically encoded 
of which the peptides and peptide analogues of the invention may be composed include, but 
are not limited to, P-alanine(b-Ala) and other omega-amino acids such as 3-aminopropionicacid 
(Dap), 2,3-diaminopropionicacid (Dpr), 4-aminobutyric acid and so forth; a-aminoisobutyric 
acid (Aib); e-amihohexanoic acid (Aha); 8-aminovaleric acid (Ava); N-methylglycine or 
safcosine (MeGly); ornithine (Om); cifrulline (Cit); t-butylalanine (t-BuA); t-butylglycine 
(t-BuG); N-methylisoleucine(Melle); phenylglycine(Phg); cyclohexylalanine(Cha); norleucine 
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(Nle); 2-naphthylalanine (2-Nal); 4-chlorophenylalanine (Phe(4-CI)); 2-fluorophenylalanine 
(Phe(2-F)); 3-fluorophenylaIanine (Phe(3-F)); 4-fluorophenylalanine(Phe(4-F)); penicillamine 
(Pen); 1, 2,3, 4-tetrahydroisoquinoline-3-carboxy lie acid. (Tic); P-2-thienylalanine . (Thi); 
methionine sulfoxide (MSO); homoarginine (hArg); N-acetyl lysine (AcLys); 2,3- 
5 diaminobutyricacid (Dab); 2,3-diaminobutyricacid (Dbu);p-aminophenylalanine(Phe(pNH)); 
N-methyl valine (MeVal); homocysteine (hCys) and homoserine (hSer). These amino acids 
also fall conveniently into the categories defined above. 

The classifications of the above-described genetically encoded and non-encoded 
amino acids are summarized in TABLE 1, below. It is to be understood that TABLE 1 is for 

10 illustrative purposes only and does not purport to be an exhaustive list of amino acid residues 
which may comprise the peptides and peptide analogues described herein. Other amino acid 
residues which are useful for making the peptides and peptide analogues described herein can 
be found, e.g., in Fasman, 1989, CRC Practical Handbook of Biochemistry and Molecular 
Biology, CRC Press, Inc., and the references cited therein. Amino acids not specifically. 

1 5 mentioned herein can be conveniently classified into the above-described categories on the basis 
of known behavior and/or their characteristic chemical and/or physical properties as compared 
with amino acids specifically identified. ' 



TABLE 1 



Classification 


Genetically Encoded 


Genetically Non-Encoded 


Hydrophobic 






Aromatic 

Apolar 


F, Y, W 

M, G, P 


Phg, Nal, Thi, tic, Phe(4-Cl), Phe(2-F), 
Phe(3-F), Phe(4-F), Pyridyl Ala, 
Benzothienyl Ala 


Aliphatic 


A, V, L, I 


. t-BuA, t-BuG, Melle, Nle, MeVal, Cha, 
bAla, MeGly, Aib 


Hydrophilic 






Acidic 


D, E 




Basic 


H, K, R 


Dpr, Om, hArg, PheCp-NHz), DBU, A2BU 


Polar 


Q, N, S, T, Y 


Cit, AcLys, MSO, hSer 


Cysteine-Like 


C 


Pen, hCys, p-methyl Cys 
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As used herein, a "detectable label" has the ordinary meaning in the art and 
refers to an atom (e.g., radionuclide), molecule (e.g., fluorescein), or complex, that is or can be 
used to detect (e.g., due to a physical or chemical property), indicate the presence of a molecule 
5 or to enable binding of another molecule to which it is covalently bound or otherwise 
associated. The term "label" also refers to covalently bound or otherwise associated molecules 
(e.g., a biomolecule such as an enzyme) that act on a substrate to produce a detectable atom, 
molecule or complex. Detectable labels suitable for use in the present invention include any 
composition detectable by spectroscopic, photochemical, biochemical, immunochemical, 

10 electrical, optical or chemical means. Labels useful in the present invention include biotin for 
staining with labeled streptavidin conjugate, magnetic beads (e.g., DynabeadsTM), fluorescent 
dyes (e.g., fluorescein, Texas red, rhodamine, green fluorescent protein, enliemced green 
fluorescent protein, and the like), radiolabels (e.g., ^H, ^^^I, ^^S, '"^C, or ^^P), enzymes ( e.g., 
hydrolases, particularly phosphatases such as alkaline phosphatase, esterases and glycosidases, 

15 or oxidoreductases, particularly peroxidases such as horse radish peroxidase., and others v 
commonly used in ELISAs), substrates, cofactors^ inhibitors, chemiluminescent groups, 
chromogenic agents, and colorimetric labels such as colloidal gold or colored glass or plastic 
(e.g., polystyrene, polypropylene, latex, etc.) beads. Patents teaching the use of such labels 
include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; 

20 and 4,366,241. Means of detecting such labels are well known to those of skill in the art. 
Thus, for example, radiolabelsandchemiluminescentlabels may be detected using photographic 
film or scintillation counters, fluorescent markers may be detected using a photodetector to 
detect emitted light (e.g., as in fluorescence-activated cell sorting). Enzymatic labels are 
typically detected by providing the enzyme with a substrate and detecting the reactionproduct 

25 produced by the action of the enzyme on the substrate, and colorimetric labels are detected by 
simply visualizing the colored label. Thus, a label is any composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical 
means. Tlie label may be coupled directly or indirectly to the desired component of the assay 
according to methods well known in the art. Non-radioactive labels are often attached by 
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indirect means. Generally, a ligand molecule (e.g., biotin) is covalently boiind to the molecule. 
The ligand then binds to an anti-ligand (e.g., streptavidin) molecule which is either inherently 
detectable or covalently bound to a signal generating system, such as a detectable enzyme, a 
fluorescent compound, or a chemiluminescentcompound. A number of ligands and anti-ligands 
5 . can be used. Where a ligand has a natural anti-ligand, for example^ biotin, thyroxine, and 
Cortisol, it can be used in conjunction with the labeled, naturally^ occurring anti-ligands. 
Altematively, any haptenic or antigenic compound can be used in combination with an 
, antibody. The molecules can also be conjugated directly to signal generating compounds, e.g., 
by conjugation with an enzymie or'fluorophore. Means of detecting labels are well known to 
10 those of skill in the art. Thus, for example, where the label is a radioactive label, means for 
detection include a scintillation counter, photographic film as in autoradiography, or storage 
phosphor imaging. Where the label is a fluorescent label, it may be detected by exciting the 
fluorochrome with the appropriate wavelength of light and detecting the resulting fluorescence. 
The fluorescence may be detected visually, by means of photographic film, by the use of 
15 electronic detectors such as charge coupled devices (CCDs) or photomultipliers and the like. 
Similarly, enzymatic labels may be detected by providing the appropriate substrates for the 
enzyme and detecting the resulting reaction product. Also, simple colorimetric labels may be 
detected by observing the color associated with the label It will be appreciated that when pairs 
of fluorophores are used in an assay, it is often preferred that they have distinct emission 
20 patterns (wavelengths) so that they can be easily distinguished. 

As used herein, the tenri "substantially identical" in the context of comparing 
amino acid sequences, means that the sequences have at least about 70%, at least about 80%, 
or at lea;st about 90% amino acid residue identity when compared and aligned for maximum 
25 correspondence. An algorithm that is suitable for determining percent sequence identity and 
sequence similarity is the FASTA algorithm, which is described in Pearson, W.R. & Lipman, 
D.J., 1988, Proa Natl. Acad ScL U.S.A. 85: 2444. See also W. R. Pearson, 1996, Methods 
Enzymol. 266: 227-258. Preferred parameters used in a FASTA alignment of DNA sequences 
to calculate percent identity are optimized, BL50 Matrix 15: -5, k-tuple = 2; joining penalty 
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= 40, optimization = 28; gap penalty -12, gap length penalty =-2; and width =16. 

As used herein, "hematopoietic cells" include leukocytes includingly mphocy tes 
(T cells, B cells and NK cells), monocytes, and granulocytes (i.e., neutrophils, basophils and 
5 eosinophils), macrophages, dendritic cells, megakaryocytes, reticulocytes, erythrocytes, and 
CD34^ stem cells. 

As used herein, the terms "test compound" or "test agent" are used 
interchangeably and refer to a candidate agent that may have enhancer/agonist, or 

10 inhibitor/antagonist activity, e.g., inhibiting or enhancing an interaction such as PDZ-PL 
binding. The candidate agents or test compounds may be any of a large variety of compounds, 
both naturally occurring and synthetic, organic and inorganic, and including polymers (e.g., 
oligopeptides, polypeptides, oligonucleotides, and polynucleotides), small molecules,, 
antibodies (as broadly defined herein), sugars, fatty acids, nucleotides and nucleotide analogs, 

15 analogs of naturally occurring structures (e.g., peptide mimetics, nucleic acid analogs, and the 
like), and numerous other compounds. In certain embodiment, test agents are prepared from 
diversity libraries, such as random or combinatorial peptide or non-peptide libraries. Many 
libraries are known in the art that can be. used, e.g., chemically synthesized libraries, 
recombinant (e.g., phage display libraries), and in vitro translation-based libraries. Examples 

20 of chemically synthesized libraries are described in Fodor et al., 1991, Science 251:767-773; 
Houghtenet al., 1991,A^t//z/re354:84-86;Lamet al., 1991,7Va/wre354:82-84; Medynsk^ 1994, 
Bio/Technology 12:709-710; Gallop et al., 1994, J. Medicinal Chemistry 37(9):1233-1251; 
Ohlmeyer et aL, 1993, Proc. Natl Acad Scl USA 90:10922-10926; Erb et al., \99A,Proc. Natl 
Acad, Set USA 91:1 1422-1 1426; Houghten et al, 1992, Biotechniques 13:412; Jayawickreme 

25 et al., 1994, Proc. Natl Acad. Sci. USA 91:1614-1618; Salmon et aL, 1993, Proc. Natl Acad. 
Sci. USA 90: 1 1 708- 1 1 71 2; PCX PublicationNo. WO 93/20242; and Brenner and Lerner, 1 992, 
Proc. Natl Acad. ScL USA 89:5381-5383. Examples of phage display libraries are described 
m Scott and Smith, 1990, Sconce 249:386-390; Devlin et al., 1990, Science, 249:404-406; 
Christian, R.B., et al., 1992, J. Mol Biol 227:711-718); Lenstra, 1992, J. Immunol Meth. 
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152: 149- 157; Kay etal., 1993, Gene 128:59-65;and PCT Publication No. WO 94/1 83 18 dated 
August 18, 1994. In vitro translation-based libraries include but are not limited to those 
described in PCT Publication No. WO 91/05058 dated April 18, 1991; and Mattheakis et al., 
1994, Proc. Natl. Acad ScL f/&4 91:9022-9026. By way of examples of nonpeptide libraries, 
5 a benzodiazepine library (5^ee^.g.,Bumn etal., 1994, Proc.A/ar/.yicaci C/iS^ 91:4708-4712) 
can be adapted for use. Peptoid libraries (Simon et al., 1992, Proc. Natl. Acad. ScL USA 
89:9367-9371) can also be used. Another example of a library that can be used, in which the 
amide fimctionalitiesin peptides have been permethylated to generate a chemically transformed 
combinatorial library, is described by Ostresh et al. (1994, Proc. Natl. Acad. ScL USA 
10 91:11138-11142). 

The term "specific binding" refers to binding between two molecules, for 
example, a ligand and a receptor, characterized by the ability of a molecule (ligand) to associate 
with another specific molecule (receptor) even in the presence of many other diverse molecules, 
15 i.e., to show preferential binding of one molecule for another in a heterogeneous mixture of 
molecules. Specific binding of a ligand to a receptor is also evidenced by reduced binding of a 
detectably labeled ligand to the receptor in the presence of excess unlabeled ligand (i.e., a 
binding competition assay). 

20 As used herein, a "plurality" of PDZ proteins (or corresponding PDZ domains 

or PDZ fusion polypeptides) has its usual meaning. In some embodiments^ the plurality is at 
least 5, and often at least 25, at least 40, or at least 60 different PDZ proteins. In some 
embodiments, the plurality is selected from the list of PDZ polypeptides listed in TABLE 6. 
In some embodiments, the plurality of different PDZ proteins are from (i.e., expressed in) a 

25 particular specified tissue or a particular class or type of cell. In some embodiments, the 
plurality of different PDZ proteins represents a substantial , fraction (e.g., typically at least 
50%, more often at least 80%) of all of the PDZ proteins known to be, or suspected of being, 
expressed in the tissue or cell(s), e.g., all of the PDZ proteins known to be present in 
lymphocytes or hematopoetic cells. In some embodiments, the plurality is at least 50%, 
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usually at least 80%, at least 90% or all of the PDZ proteins disclosed hereinas being expressed 
in a particular cell. 

When referring to PL peptides (or the corresponding proteins, e.g., 
corresponding to those listed in TABLE 3, or elsewhere herein) a "plurality" may refer to at 
5 least 5, at least 10, and often at least 25 PLs such as those specifcally listed herein, or to the 
classes and percentages set forth supra for PDZ domains. 

II. Overview 

The present inventors have identified a large number of interactions between 

10 PDZ proteins and GPCR proteins containing a PL motif. These interactions can play a 
significant role in the biological function of certain cells and in a variety of physiological 
systems. As used herein, the term "biological function" in the context of a cell, refers to a 
detectable biological activity normally carried out by the cell, e.g., a phenotypic change such 
as proliferation, cell activation (e.g., T cell activation, B cell activation, T-B cell conjugate 

1 5 formation), cytokine release, degranulation, tyrosine phosphorylation, ion (e.g., calcium) flux, 
metabolic activity, apoptosis, changes in gene expression, maintenance of cell structure, cell 
migration, adherence to a substrate, signal transduction, cell-cell interactions, and others 
described herein or known in the art. 

Because the interactions involve proteins which are involved in diverse 

20 physiological systems (see Background section supra), the methods provided herein can be 
utilized to broadly or selectively modulate a number of different biological functions. Methods 
are also disclosed herein for determining whether a test compound acts as a modulator of 
binding between a particular PDZ protein and PL protein binding pair. Both agonists and 
antagonists of the binding pairs can be identified by such screening methods. Modulators so 

25 identified can subsequently be formulated as a pharmaceutical composition and used in the 
treatment of various diseases that are correlated with binding between aparticular PDZ protein 
and PL protein or set of such proteins. 

III. PDZ Protein and PL Protein Interactions 
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TABLE 2 lists PDZ proteins and GPCR-PL proteins which the current 
inventors have identified as binding to one another. Each page of TABLE 2 includes six 
columns. The columns are numbered from left to right such that the left-most column is column 
1 and the right-most column is column 6. Thus, the first column is labeled "AVC PL ID" and 
5 lists AA numbers that serve as unique internal designations for each PL peptide. These ID 
numbers correspond to those listed in column 6 of TABLE 3. The second column is labeled 
"PL" and lists the various PL proteins/PDZ-Ligands that were examined. This column lists 
gene abbreviations, with full gene names listed in parentheses, for peptides corresponding to 
the carboxyl-terminal 20 amino acids of the protein listed. The third column, labeled "PL 

10 20mer Sequence," lists the carboxyl-terminal 20 amino acids of the protein. All ligands are 
biotinylated at the amino-terminus. Some have been modified to eliminate cysteine amino acids 
from the 20mer sequence. Modifications have been noted in column 2, and wildtype sequences 
are presented in TABLE 3. 

The PDZ protein (or proteins) that interact(s) with a PL peptide are listed in 

15 the fourth column that is labeled "PDZ". This column provides the gene name for the PDZ 
portion of the GST-PDZ fusion that interacts with the PDZ-ligand to the left. For PDZ 
domain-containing proteins with multiple domains the domain number is listed to the right of 
the PDZ (i.e., in column 5 labeled "PDZ Domain"), and indicates the PDZ domain number 
when nunibered from the amino-terminus to the carboxy-terminus. 

20 ' The sixth colurrin labeled "Binding Strength" is another measure of the level of 

s 

binding. In particular, it provides an absorbance value at 450 nm which indicates the amount 
of PL peptide bound to the PDZ protein. The following numerical values have the following 
meanings: 'V - A450nm 0-1; - A450nm 1-2; '3'- A450nm 2-3; '4' - A450nm 3-4; '5' - 
, A450nm of 4. All interactions have been repeated a total of at least 4 times, and all show 
25 A450nm values that are at least two times that of controls. 

Further information regarding these PL proteins and PDZ proteins is provided 
in TABLES 3 and 6. In particular, TABLE 3 provides a partial listing of known G-protein 
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coupled receptors, along with the amino acid sequence of the carboxyl-terminal20 amino acids.. 
When numbered from left to right, the first column labeled "Gene Name(Synonyms)" provides 
the most commonly used name for that gene, with synonyms or acronyms listed in 
parentheses. Genbank reference numbers (Accession number and GI number) are listed in 
5 column 2, labeled "Genbank Reference." Columns 3 and 4, labeled "Last 20 aa" and "Last 4 aa," 
respectively, list the last 20 amino acids, and the last 4 amino acids of each protein. Column 
5, labeled "PL?" marks with an "X" those carboxy-terminal sequences that are predicted to 
display a classic PL amino acid motif. Many of the carboxyl-terminal motifs that are not 
marked in column 6 may exhibit binding to PDZ proteins, and the designation as a classic PL 

10 motif is in no way intended to predictor restrict GPCR binding patterns to PDZ proteins. The 
sixth column labeled "AVC PL ID" provides the internal designation number used to refer to 
a particular PL protein and correlates with the designation used in column 1 of TABLE 2. 
Those PL proteins that have been assigned an internal AVC ID 

Many of the genes listed in TABLE 3 express more than one amino acid 

1 5 sequence^ depending on alternative exon splicing and single amino acid changes. The Genbank 
reference presented is intended to represent one isoform of the gene listed. Alternatively 
spliced and point mutated isoforms of the genes listed in TABLE 3 are given separate Genbank 
references, and have been omitted for the purpose of brevity . In all cases, the carboxyl-terminal 
sequence is unaffected by these changes. In cases, where the carboxyl-terminal sequence is 
- 20 affected by point mutations or alternative splicing, both isoforms have been included. In 
addition, those GPCR's for which no known receptor ligand has been identified (so-called 
"orphan GPCR's") have also been omitted. As indicated supra, all peptides are biotinylated 
at the amino terminus and the amino acid sequences correspond to the C-terminal sequence of 
the gene name listed to the left. 

25 ' TABLE 6 Hsts the sequences of the PDZ domains cloned into a vector (PGEX- 

3X vector) for production of GST-PDZ fusion proteins (Pharrhacia), More specifically, the 
first column (left to right) entitled "PDZ Gene Name" lists the name of the gene containing the 
PDZ domain. The second column labeled "GI or Acc#" is a unique Genbank identifier for the 
gene used to design primers for PCR amplification of the listed sequence. The next column 
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labeled "Domain#" indicates the Pfam-predicted PDZ domain number, as numbered from the 
amino-terminus of the gene to the carboxy-terminus. The last column entitled "Sequence fused 
to GST construct" provides the actual eimino acid sequence inserted into the GST-PDZ 
expression vector as determined by DNA sequencing of the constructs. 
5 As discussed in detail herein, the PDZ proteins listed in TABLES 2 and 6 are 

naturally occurring proteins containing a PDZ domain. Only significant interactions are 
presented in this table. Thus, the present invention is particularly directed to the detection and 
modulation of interactions between a PDZ protein and PL protein pair listed in TABLE 2. As 
used herein the phrase "protein pair" or 'protein binding pair" when used in reference to a PDZ 

10 protein and PL protein refers to a PD protein and PDZ protein listed in TABLE 2 which bind 
to one another. It should be understood that TABLE 2 is set up to show that certain PL 
proteins bind to a plurality of PDZ proteins. For example, PL protein AA329 binds to PDZ 
proteins KIAA0973 and KIAA0807. 

Interactions between GPCR proteins containing a PDZ ligand and PDZ 

15 proteins are not limited to those listed in TABLE 2. TABLE 4 presents a list of interactions 
between GPCR proteins and PDZ proteins. When numbered from left to right, the first 
column, labeled "GPCR gene," lists the GPCR protein that binds to a PDZ domain-containing 
protein. The second column labeled "PDZ-containingGene" lists the specific PDZ-containing 
gene that binds to the GPCR gene listed to the left. The PDZ domain that binds to the GPCR 

20 is listed in column 3, labeled "PDZ domain(s)." These interactions were confirmed using 
assays other than the "G" or "A" assays described infra^ and suggest that changes in PDZ-PL 
binding patterns may occur with the use of different assays or with the use of assay variations 
described infra. 

The interactions summarized in TABLE 2 can occur in a wide variety of cell 
25 types. Examples of such cells include hematopoietic, stem, neuronal, muscle, epidermal, 
epithelial, endothelial, and cells from essentially any tissue such as liyer, lung, placenta, uterus, 
kidney, ovaries, testes, stomach, colon and intestine. Because the interactions disclosed herein 
can occur in such a wide variety of cell types, these interactions can also play an important role 
in a variety of biological functions. 
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Thus, for example, in certain embodiments of the invention, the PL proteins of 
the invention bind a PDZ protein expressed in T lymphocytes, B lymphocytes, or both T and 
B lymphocytes. In an embodiment, the PL protein binds a PDZ protein expresised in 
endothelial cells. In various embodiments, the PL proteins and/or the PDZ protein to which 
5 it binds are not expressed in the nervous system (e.g., neurons). In still other instances the PL 
protein binds a PDZ protein that is expressed in neuronal cells. 

In various embodiments of the invention, the PL protein is expressed or up- 
regulated upon cell activation(e.g., in activatedB lymphocytes, T lymphocytes) or upon entry 
into mitosis (e.g., up-regulation in rapidly proliferating cell populations). 

10 In certain other various embodiments of the invention, the PL protein is (i) a 

protein that mediates immune cell (e.g., hematopoietic cell) activation or migration, (ii) a protein 
that does not mediate apoptosis in a cell type, (iii) a protein that is a G-protein coupled seven 
transmembrane helix receptor but not a serotonin receptor, (iy) a protein that is G-protein 
coupled seven transmembrane helix receptor but not a cytokine receptor, or (v) a protein that 

15 is a G-protein coupled seven transmembrane helix receptor and is a cytokine receptor. 

IV. Classification of Interactions 

20 A. General 

The interactions summarized in TABLE 2 can occur in a wide variety of cell 
types. Examples of such cells include hematopoietic, stem, neuronal, muscle, epidermal, 
eipthelial, endothelial, and cells from essentially any tissue such as liver, ling, placenta, uterus, 
kidney, ovaries, testes, stomach, colon and intestine. Because the interactions disclosed herein 

25 can occur in such a wide variety of cell types, these interactions can also play an important role 
in a variety of biological functions. Consequently, modulationof the interactions between PDZ 
proteins and PL proteins that are described herein can be utilized to regulate biological function 
in a v^ade range of cells. 
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B. Exemplary PDZ Classification 

The PDZ proteins identified herein as interacting with particular PL proteins 
can be grouped into a number of different categories. Thus, as described in greater detail below, 
the methods and reagents that are provided herein can be utilizedto modulate PDZ interactions, 
5 and thus biological functions, that are regulated or otherwise involve the following classes of 
proteins. It should be recognized, however, that modulation of the interactions that are 
identified herein can be utilized to affect biological functions involving other protein cleisses. 

L Protein mnases 

10 A number of protein kinases contain PDZ domains. Protein kinases are widely 

involved in cellular metabolism and regulation of protein activity through phosphorylation of 
amino acids on proteins. An example of this is the regulation of signal transduction pathways 
such as T cell activation through the T cell Receptor, where Z AP-70 kinase function is required 
for transmission of the activation signal to downstream effector molecules. These molecules 

15 include, but are not limited to KIAA0303. KIAA056L KIAA0807. KIAA0973, and CASK. 

2. Guahalyte Kinases 

A number ofguanalyte kinases contain PDZ domains. These molecules include- 
but are not limited to Atrophin L CARDIL CARD14, DLGL DLG2, DLG5> FLJ12615, 
20 MPPl, MPP2. NeDLa p55T- PSD95. ZO-L ZO-2- and ZO-3. 

3. Guanine Exchange Factors 

A number of guanine exchange factors contain PDZ domains. Guanine exchange 
factors regulate signal transductionpathwavs and other biologicalprocesses through facilitating 
25 the exchange of differently phosphorvlated guanine residues. These molecules include, but are 
not limited to GTPase. Guanine Exchange, KIAA0313. K1AA0380,KIAA0382, KIAA1389, 
KIAA1415, TIAMl, and TAIM2. 

4. LIM PDZ^s 
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A number of LIM PDZ's contain PDZ domains. These molecules include, but 
are not limited to a-Actinin 2. ELFIN 1. ENIGMA. HEME A 1003117. KIAAQ613. 
KIAA0858. KIAA0631.LIM Mystique. LIM protein. LIM-RIL.LIMK1.LIMK2. andLU-1. 

5 5, Proteins Containing Only PDZ Domains 

A number of proteins contain PDZ domains without any other predicted 
functional domains. These include, but are not limited to 26s subunit p27. AIPC. Cytohesion 
Binding Protem. EZRIN Binding Protein. FLJOOOIL FLJ20075. FLJ21687. GRIPl. 
HEMBA1000505.KIAA0545. KIAA0967.KIAA1202.KIAA1284. KIAAl 526. KIAA 1620. 
10 KIAA 171 9. MAGI 1. Novel PDZ gene. Outer Membrane. PAR3. PAR6. PAR6y. PDZ-73. 
PDZKl. PICKl. PIST. prIL16. Shankl. SIPl. SITAC-18. Svnteniri. Svntrophin 72. TIPl. 
TIP2. and TIP43. 

6. Tyrosine Phosphatases 
15 A number of Tyrosine phosphatases contain PDZ domains. Tyrosine 

phosphatases regulate biological processes such as signal transduction pathways through 
removal of phosphate groups required for function of the target protein. Examples of such 
enzymes include, but are not limited to PTN-3. PTN-4, and PTPLL 

20 7. Serine Proteases 

A number of serine proteases contain PDZ domains. Proteases affect biological 
molecules bv cleaving them to either activate or repress their functional ability. These enzymes 
have a variety of functions, including roles in digestion, blood coagulation and lysis of blood 
clots. These include, but are not limited to Novel Serine Protease and Serine Protease. 
25 . ^ 

8. Viral Oncogene Interacting Proteins that Contain PDZ Domains 
A number of TAX interacting proteins contain PDZ domains. Many of these 
also bind to multiple viral oncoproteins such as Adenovirus E4. Papillomavirus E6. and HB V 
protein X. These include, but are not limited to AIPC. Connector Enhancer. DLGl. DLG2. 
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ERBIN. FLJOQQIL FLJ11215. HEMBA10031 17, INADL, KIAA0147, KIAA0807. 
KIAA1526, KIAA1634. LIMKl. LIM Mystique, LIM-RIL, MUPPl. NeDLG. Outer 
Membrane, PSD95, PTN-4, PTPL-L Svntrophin yh Svntrophin72, TAX2-like protein. TIP2. 
TIP1.TIP33, and TIP43. 

5 . 

9. Proteins Containing RA and/or RHA and/or DIL and/or IGFBP and/or 
WW and/or L27 and/or SAM and/or PH and/or DIX and/or DIP and/or Dishevelled and/or LRR 
and/or Hormone 3 and/or C2 and/or RPH3 A and/or zf-TRAF and/or zf-C3HC4 and/or PIP 
and/or NO Synthase and/or Flavodoxin and/or FAD binding and/or NAD binding and/or Kazal 
10 and/or Trypsin and/or RBD and/or RGS and/or GoLoco and/or HRl and/or BROl That Contain 
PDZ Domains 

A number of proteins containing RA and/or RHA and/or DIL and/or IGFBP 
and/or WW and/or L27 and/or SAM and/or PH and/or DIX and/or DIP and/or Dishevelled 
and/or LRR and/or Hormone 3 and/or C2 and/or RPH3A and/or zf-TRAF and/or zf-C3HC4 

1 5 and/or PID and/or NO Synthase and/or Flavodoxin and/or FAD binding and/or NAD binding 
and/or Kazal and/or Trypsin and/or RBD and/or RGS and/or GoLoco and/or HRl and/or BRO 1 
contain PDZ domains. These include, but are not limited to AF6, APXL-1, MAGIl, DVLl, 
DVL2, DVL3, KIAA0417, KIAA0316, KIAA0340, KIAA0559, KIAA0751, KIAA0902, 
KIAA1095, KIAA1222, KIAA1634, MINTl, NOSl, RGS12, Rhophilin-like, Shank 3, 

20 Syntrophin la, Syntrophin P2, and XI 1 (3. 

C, Exemplary PL Classification 

The GPCR-PL proteins involved in the interactions listed in TABLE 2 are from 
a number of different classes. Consequently, the methods and reagents that are disclosed herein 
25 can .be utilized to modulate interactions involving the following classes of GPCR-PL proteins 
to modulate a biological function in cells, but are not intended to be limiting in scope of 
biological processes or diseases affected. The following classes, however, should not be 
considered exhaustive of the^ypes of classes of GPCR proteins whose activity can be 
modulated using the methods and reagents that are provided herein. 



L Serotonin Receptors 

Serotonin receptors are involved in a variety of physiological 
functions, including nociception, motor control, endocrine secretion thermoregulation, 
5 appetite, control of exchanges between the central nervous system and cerebospinal fluid, 
prostate cancer, hormone overproduction by endocrine tumors, migrane, irritable bowel 
syndrome, Alziemer's disease, drug withdrawls, and a number of psychological disorders, 
including but not limited to depression, obsessive compulsive disorder, schizophrenia, and 
anxiety.. Representative members of this family include, but are not limited to, 5-HTl A, 5- 
10 HTIB, 5-HTlD, 5HT1F, 5-HT2A, 5-HT2B, 5-HT2C, 5-HT4, 5-HT5A, 5-HT6, and 5- 
HT7. Modulation of serotonin receptor interactions with PDZ proteins may provide an 
effective means for treating a number of diseeises, including but not limited to those listed 
above. \ 

15 2. Histamine Receptors 

Histamine receptors are involved in histamine responses, and affect 
several systems that result in asthma, allergy and inflammation responses. In addition, 
histamine receptors have been implicated in anaphylaxis, rhinoconjunctivitis, and 
Gastroesophageal reflux disease (GERD). Representative GPCRs include, but are not 

20 limited to, Histamine HI receptor, histamine H2 receptor, histamine H3 receptor, and 

histmane H4 receptor. Modulation of histamine receptor interactions with PDZ proteins 
may provide effective treatments for these and many other diseases. 

3. Acetylcholine Receptors 
25 Acetylcholine receptors are involved in activation of neurons. 

Inappropriate activation can lead to Parkinson's like symptoms in animal models, increased 
metabolic activity, increased cardiac activity, epilepsy, and psuchological disorders and 
responses. Representative members include, but are not limited to, ACMl, ACM2, ACM3, 
ACM4 and ACM5. 
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4. Adrenoceptors 

Adrenoceptors are involved a number of biological process, including 
synaptic plasticity, long term potentiation, inflammation, asthma, obesity, rheumatoid 
5 arthritis, overactive bladder disorder, and hypertension. In addition, these receptors have 
been implicated in heroin addiction, chronic heart failure, and other cardiovascular diseases. 
Representative members of this family include the betal-, beta2-, betaS- and beta4- 
adrenergic receptors, and the alpha 1- and alpha2- adrenergic receptors. Modulation of 
adrenergic receptor interactions with PDZ proteins may provide effective treatments for the 
10 diseases Usted above and other cardiovascular diseases. 

5. Dopamine Receptors 

Dopamine receptors are known to , be essential for normal 
neurotransmission. Abnormalities in dopamine receptor function or localization can result in 

15 a number of neurological diseases, including but not limited to Parkinson's disease, 
schizophrenia, and Attention Deficit Hyperactivity Disorder (ADHD), -Representative 
members of this group include but are not limited to Dopamine 1 receptor, Dopamine 2 
receptor, Dopamine 3 receptor. Dopamine 4 receptor, and Dopamine 5 receptor. Modulation 
of dopamine receptor interactions with PDZ proteins may provide effective treatments for a 

20 variety of neurological disorders, including those listed above. 

6. Bradykinin Receptors 

Bradykinin receptors are involved in a number of biological functions, 
including inflammation, tissue injury, asthma, perennial rhinitis, diabetes, and brain edema. 
25 Bradykinin receptors have also been implicated in various cardiovascular diseases, including 
hypertension, myocardial hypertrophy, myocardial infarction, and arrhythmias. 
Representative members of this group include but are not limited to B 1 bradykinin receptor and 
B2 bradykinin receptor. Modulation of bradykinin receptor interactions with PDZ proteins 
may provide effective treatments for many diseases, including those listed above. 
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7, Anaphylatdxin Chemotactic Receptors 

Anaphylatoxin chemotactic receptors and their homoiogues form a 
group that is highly involved in the inflammatory response, and is involved in other 
5 biological functions to a lesser degree. Representative members of this group include C5a 
anaphylatoxin chemotactic receptor and C3a anaphylatoxin chemotactic receptor. 
Modulation of anaphylatoxin chemotactic receptor interactions with PDZ proteins may 
provide potent therapies for inflammation. 

10 8, Interleukin 8 Receptors 

Interleukin 8 receptors play a role in lung disease, multiple myeloma, 
and inflammation. Representative members of this group include IL8RA £uid IL8RB. 
Modulation of interleukin receptor interactions with PDZ proteins may provide effective 
treatments for these and other diseases. 

15 

9, Fmet-leu-phe Receptors 

Fmet-leu-phe (FMLF or FMLP) receptors are receptors to 
chemoattractants, and thus are highly involved in inflammation, tissue injury and repair, and 
phygocytosis of foreign bacteria or microbes. Representative members of this group include 
20 but are not limited to FMLP receptor I and FMLP receptor II. Modulation of FMLP receptor 
interactions with PDZ proteins may provide effective means for regulating chemotaxis and 
inflammation, and for treating bacterial or viral infections. 



1 0. Angiotensin Receptors 
25 Angiotensin receptors are known to be involved in diabetes, 

hypertension, cardiovascular disease, renal disease, proteinuria and other diseases. 
Representative members of this group include but are not limited to type 2 angiotensin II 
receptor, type lA angiotensin II receptor, and type IB angiotensin II receptor. Modulation 
of angiotensin receptor interactions with PDZ proteins may provide effective treatments for 
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many diseases, including those listed above. 

1 1 . Endothelin Receptors 

Endothelin receptors play a role in a variety of biological functions, 
5 including a major role in the female reproductive cycle. In addition, these receptors have been 
implicated in a number of diseases, including glaucoma, hypertension, congestive heart failure, 
and cerebral vasospasm. Representative members of this group include but are not limited to 
endothelin A receptor and endothelin B receptor. Modulation of endothelin receptor 
interactions with PDZ proteins may provide effective treatments for a variety of diseases, 
10 including those listed above. 

12. Melanocortin Receptors 

Melanocortin receptors are known to be involved in obesity, anorexia 
nervosa, nociception, and a variety of other biological processes or disorders. Representative 
15 members of this group include but are not limited to adrenocorticotropic hormone receptor, 
melanocortin receptor 2, melanocortin receptor 3, melanocortin receptor 4, melanocortin 
receptor 5, and melanocyte stimulating hormone receptor. Modulation of melanocortin 
receptor interactions with PDZ proteins may provide effective treatment for diseases such as 
obesity and anorexia nervosa. 

20 

13. Neuropeptide Y Receptors 

Neuropeptide Y receptors are known to be involvied in a number of 
biological functions and diseases, including stress, cardiovascular disease, feeding disorders, 
seizures, hypertension, obesity, anxiety, diabetes, and intestinal disorders. Representative 
25 members of this group include but are not limited to Neuropeptide Y receptor type 1, 
Neuropeptide Y receptor type 2, Neuropeptide Y receptor type 4, and Neuropeptide Y 
receptor type 5. Modulation of neuropeptide receptor interactions with PDZ proteins may 
provide effective treatments for those diseases listed above and many others. 
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14. Neurotensin Receptors 

Neurotensin receptors are involved in a variety of diseases, including 
psychological disorders such as Parkinson's disease and schizophrenia. Representative 
members of this group include but are not limited to neurotensin receptor type 1 and 
neurotensin receptor type 2. Modulation of neurotensin receptor interactions with PDZ 
proteins may provide effective treatment for psychological disorders and other diseases. 

15. Opioid Receptors 

Opioid receptors are involved in a variety of diseases, including but not 
limited to polycystic ovarian syndrome, irritable bowel syndrome, heroin addiction, and ileus. 
Representative members of this group include mu-opioid receptor, delta-opioid receptor, 
kappa-opioid receptor, and nociceptin receptor. Modulation of opioid receptor interactions 
with PDZ proteins may provide effective treatments for these and many other diseases. 

16. Somatostatin Receptors 

Somatostatin receptors are involved in the modulation of endocrine 
and exocrine functions in both nervous and non-nervous tissues, and plays a role in obesity, 
diabetes mellitus, acromegaly, and many other diseases. Representative members of this 
family include sstl, sst2A, sst2B, sst3, sst4 and sst5. Modulation of somatostatin receptor 
interactions with PDZ proteins may provide effective treatments for these diseases and 
various cancers, due to somatostatin receptor overexpression on many types of tumors. 

17. Tachykinin Receptors 

Tachykinin receptors are involved in a number of diseases and disorders, such 
as incontinence, migrane, fibromyalgia, asthma, emesis, psoriasis, central nervous system 
disorders, and gastrointestinal diseases. Representative members of this group include 
Substance P receptor, Substance K receptor. Neuromedin K receptor 3, and Neuromedin K 
receptor 4. Modulation of tachykinin receptor interactions with PDZ proteins may provide 
effective treatment for those diseases listed above and many others. 
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1 8. Vasopressin-like Receptors 

Vasopressin-like receptors are involved in many biological functions, 
including reproductive regulation and water metabolism. Oxytocin receptor is highly 
5 involved in the reproductive system, regulating parturition, lactation, and other reproductive 
functions. Representative members of this group include Vasopressin VIA, Vasopressin 
VIB, Vasopressin V2, and Oxytocin receptor. Modulation of vasopressin-like receptor 
interactions with PDZ proteins may provide an effective means for regulating, among 
others, reproductive function and water metabolism. 

10 

19. Galanin-like Receptors 

Galanin-like receptors are involved in a variety of diseases and 
disorders, including obesity, Alzheimer's disease, epilepsy, eating disorders, and depression. 
Representative members of this group include but are not limited to Galanin receptor type 1, 
15 and Galanin receptor type 2, Modulation of galanin-like receptor interactions with PDZ 
proteins may provide effective treatments for many diseases, including those listed above. 

20. Proteinase-activated like Receptors 

Proteinase-activated like receptors are involved in vascular and 
20 cardiovascular disease, cancer, gastrointestinal disease and inflammation. Representative 
members of this group include but are not limited to Proteinase-activated receptor 2, 
Proteinase-activated receptor 3, and Thrombin receptor. Modulation of proteinase-activated 
like receptor interactions with PDZ proteins may provide effective treatments for those 
diseases listed above, in addition to many others. 

25 

2 1 . Orexin & Neuropeptide FF Receptors 

Orexin and Neuropeptide FF receptors are involved in many diseases, 
such as eating disorders and narcolepsy. Representative members of this group include but are 
not limited to Neuropeptide FF receptor 1 , Neuropeptide FF receptor 2, Orexin receptor 1 , and 
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Orexin receptor 2. Modulation of orexinand neuropeptide FF receptor interactions witli PDZ 
proteins may provide effective treatments for many diseases, including those listed above. 

22. Urotensin II Receptors 

5 ' 

23. AdrenomeduUin Receptors 
10 24. Endothelin B-like Receptors 

25. Chemokine Receptors 

Chemokine receptors and their homologues form a group that is 
15 involved in many biological processes, including but not limited to immunosurveillance, 
inflammation, viral infection, lung disease, graft/transplant rejection, HIV infection, 
autoimmune disease, angiogenesis, tumorigenesis, wound healing, and metastasis. 
Modulation of chemokine receptor interactions with PDZ proteins may provide effective 
treatments for these and other diseases. 

20 

26. Neuromedin U Receptors 

27. Hormone Receptors ^ 

25 Hormone receptors are involved in a number of endocrine functions 

and diseases, including but not limited to Graves* disease, autoimmune hypothyroidism, and 
thyroid cancer. Representative members of this group include follicle-stimulating hormone 
receptor, lutropin-choriogonadotropic hormone receptor, thyrotropin receptor, luteinizing 
hormone receptor, and gonadotropin receptor. Modulation of hormone receptor 
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interactions with PDZ proteins may provide effective treatments for these and other 
endocrine diseases. 

28. Rhodopsin Receptors 

5 Rhodopsin receptors are highly involved in the visual system, regulating 

signal transduction in response to light stimuli. Representative members of this group include 
but are not limited to blue-sensitiveopsin receptor, red-sensitive opsin receptor, green-sensitive 
opsin receptor, and rhodopsin. Modulation of rhodopsin receptor interactions with PDZ 
proteins may provide treatment for many diseases of the visual system. 

10 

29. Olfactory Receptors 

Olfactory receptors are involved primarily in the sense of smell. 
Representativemembersofthis groupincludeORlAl,ORlCl, OR2A4, OR2B2,OR2Wl,and 
OR2J3, in addition to many others. Modulation of olfactory receptor interactions with PDZ 
15 proteins may provide treatments for temporary loss of smell and permanent anosmia. 

30. Adenosine Receptors 

Adenosine receptors and their homologues form a group that is 
involved in renal disease, asthma, Parkinson's disease, and many other diseases. 
20 . Representative members of this group include, but are not limited to Adenosine Al 

receptor, Adenosine A2A receptor. Adenosine A2B receptor, and Adenosine A3 receptor. 
Modulation of adenosine receptor interactions with PDZ proteins may provide effective 
treatments for the diseases listed above, and others. 

25 31. ^ Cannabis Receptors 

Cannabis receptors have been implicated in psychological disorders, 
hypotension, cardiovascular regulation, pain regulation, movement, memory, and appetite. In 
addition, they have been investigated as potential therapies for Huntington's Disease, 
Parkinson's disease, schizophrenia, and tremor. Representative members of this group include 
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but are not limited to Cemnabinoid receptor 1 and Cannabinoid receptor 2. Modulation of 
cannabis receptor interactions with PDZ proteins may provide therapies such as those listed 
above. 

32. Platelet Activating Factor Receptors 

3 3 . Gonadotropin-releasing hormone Receptors 

34. Thyrotropin-releasing hormone & Secretagogue Receptors 
Thyrotropin-releasirig hormone & secretagogue receptors are known to 

be involved in many thyroid diseases, including hypo- arid hyperthyroidism, amyotrophic 
lateral sclerosis (ALS), obesity, and gastrointestinal disorders such as inflammatory bowel 
disease and ulcerative colitis. Representative members of this group include but are not limited 
to Growth Hormone Secretagogue receptor type 1 , Motilin receptor, and thyrotropin-releasing 
hormone receptor. Modulation of thyrotropin-releasing hormone & secretagogue receptor 
interactions with PDZ proteins may provide relief for the 

35. Melatonin Receptors 

Melatonin receptors are most commonly recognized for their role in the 
circadian rhythm, however, these receptors also play a role in the cerebrovascular, reproductive, 
visual, neuroendocrine, and neuroimmunological systems. In addition, they are associated with 
cancer, rheumatoid arthritis, and reduction of NSAID-caused lesions. Representative members 
of this group include but are not limited to melatonin receptor lA, melatonin receptor IB, and 
melatonin-related receptor. Modulation of melatonin receptor interactions with PDZ proteins 
may provide effective therapies and treatments for a variety of diseases, including those listed 
above. 
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36. Lysosphingolipid & LP A (EDG) Receptors 



37. Leukotrine Receptors 

5 

3 8 , Calcitonin Receptors 

Calcitonin receptors play a role in bone mineral density, osteoporosis, 
and prostate cancer. In addition, calcitonin receptors have been implicated in renal function, 
10 embryonic development, and sperm function and physiology. Modulation of calcitonin 
receptor interactions with PDZ proteins may provide an effective means for treating diseases 
such as osteoporosis or prostate cancer. 

39. Corticotropin-releasing factor Receptors 
Corticotropin-releasing factor (CRF) receptors are known to be 

involved in the stress response, irritable bowel syndrome, obesity, depression, eating 
disorders, and cardiac and inflammatory diseases. Modulation of CRF receptor interactions 
with PDZ proteins may provide an effective means for treating stress and diseases 
associated with stress, including those listed above. 

40. Gastric Inhibitory Peptide Receptors 

4 1 . Glucagon Receptors 

25 Glucagon peptide and Glucagon-like peptide receptors form a group 

that is known to play a role in diabetes mellitus, obesity, and gastrointestinal repair and 
cytoprotection. In addition, glucagon receptors are integral to glucagonoma syndrome, 
which can be related to endocrine tumors. Modulation of glucagon receptor interactions with 
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20 
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PDZ proteins may provide an effective treatment for diabetes, obesity, glucagonoma, and 
disorders characterized by injury and/or dysfunction of the intestinal mucosal epithelium. 

42. Growth hormone-releasing hormone Receptors 

5 - 

43. Parathyroid hormone Receptors 

10 44. PACAP Receptors 

45. Secretin-like Receptors 

Representative membeirs of this group include but are not limited to 
15 gastric inhibitory peptide receptor, growth hormone-releasing hormone receptor, parathyroid 
hormone receptor, brain-specific angiogenesis inhibitor receptors, calcitonin receptors, CD97, 
cadherin EGF LAG receptor, corticotropin releasing factor receptors, cell surface glycoprotein 
EMRl, glucagon-like peptide receptors, Latrophilin-1 receptor, PACAP receptor, Lectomedin 
receptors, and VIP receptors. Modulation of secretin-like receptor interactions with PDZ 
20 proteins may provide effective treatment for a variety of diseases and disorders. 

46. Vasoactive intestinal polypeptide Receptors 

Vasoactive Intestinal Peptide (VIP) receptors play a role in a number 
of autoimmune diseases, including but not limited to septic shock, rheumatoid arthritis, 
25 multiple sclerosis, Crohn's disease, asthma, and autoimmune diabetes. In addition, VIP 
receptors are known to be involved in the inflammatory response and pulmonary 
hypertension, and are fundamental to Verner-Morrison syndrome. Modulation' of 
vasoactive intestinal peptide receptor interactions with PDZ proteins may provide an 
effective means for treating autoimmune diseases, affecting inflammatory responses, or 
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alleviating the symptoms of Vemer-Morrison Syndrome. 



47. Diuretic hormone Receptors 

5 

48. EMRl Receptors 

49. Latrophilin Receptors 

10 

50. Brain-Specific angiogenesis inhibitor (BAI^ Receptors 

51. Methuselah-like protein (MTH^ Receptors 

15 

52. Metabotropic glutamate receptors 

Metabotropic glutamate receptors are involved in inflammatory pain, 
anxiety, neurodegenerative diseases such as Parkinson's disease and Alzheimer's disease, 
20 brain ischemia, amyotrophic lateral sclerosis, and seizure disorders. Modulation of 

metabotropic glutamate receptor interactions with PDZ proteins may provide effective 
anticonvulsant and neuroprotective therapies and treatments for inflammatory and other 
disorders. 

25 53. GABA Receptors 

GAB A receptors, or gamma-aminobutyricacid receptors, play a critical 
role m the fine-tuning of central nervous system synaptic transmission and are attractive 
targets for the treatment of epilepsy, anxiety, depression, cognitive deficits, and nociceptive 
disorders. This family includes GBRl and GBR2. 
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54. Ocular Albinism protein Receptors 

55. Frizzled/Smoothened Receptors 



56. Vomeronasal Receptors 

57. Thromboxane Receptors 

1 0 Thromboxane receptors and their homologues form a group that is 

involved in inflammation, asthma, and cardiovascular disorders such as myocardial ischemia, 
hypertension, stroke, thrombosis, and restenosis. Modulation of thromboxane receptor 
interactions with PDZ proteins may provide effective treatments for many diseases, 
including but not limited to asthma, inflammation, and cardiovascular diseases. 

15 

58. Prostaglandin Receptors 

Prostaglandin receptors are involved in arthritis, insomnia, colori 
cancer, and many other disesises. Prostaglandin receptors also play a large role in vascular 
contraction and thus are important effectors in, among others, inflammation, myocardial 
20 ischemia, hypertension, stroke, and thrombosis. Modulation of prostaglandin receptor 
interactions with PDZ proteins may provide effective treatments for vascular diseases, 
arthritis, colon cancer, insomnia, and other diseases. 

59. GPCR Receptors Expressed on T cell surface 

25 GPCRs are used for a number of function on the surface, of T cells, 

including chemokine sensing, cytokine sensing, and environment sensing. Modulation of 
interactions between these receptors and PDZ proteins could be used to treat a wide variety 
of immune and inflammatory disorders. 
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GPCRs are used for a number of function on the surface of B cells,, 
including chemokine sensing, cytokine sensing, and environment sensing. Modulation of 
interactions between these receptors and PDZ proteins could be used to treat a wide variety 
5 of immune and inflammatory disorders. 

61. GPCR Receptors Expressed on NK cell surface 

10 

62. GPCR Receptors Expressed on Monocyte surface 
GPCR Receptors Expressed on Granulocyte surface 
GPCR Receptors Expressed on Endothelial Cell surface 

20 65. GPCR Receptors Involved in the Immune Response 

66. GPCR Receptors Involved in the Cardiovascular System 

25 

67. GPCR Receptors Involved in the Neural System 



63. 



15 



64. 



68. GPCR Receptors Involved in the Inflammatory Response 
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Many GPCRs are involvedin inflammatory responses, whether vascular, 
histamine related or other inflammatory responses. Modulation of PDZ:GPCRPL interactions 
could be used to treat these symptoms. 

69. GPCR Receptors Involved in Asthma and Allergic Inflammation 

A number of GPCR proteins containing a PL motif are involved in 
asthma and the allergic inflammatory response. These include, but are not limited to adrenergic 
receptors and leukotriene receptors. Modulation of PDZ:GPCRPL interactions could be used 
to treat these symptoms. 

70. GPCR Receptors Involved in Parkinson's Disease 

Glutamate, GABA and NMDA receptors have been implicated as 
potential drug targets that may slow progression of Parkinson's disease or treat the symptoms 
such that quality of life improves. 

71. Group 71 

Members of Group 71 include alpha 1 A- Adrenergic receptor, beta2- 
Adrenergic receptor, P2Y1 purinergic receptor, GRK6A , beta 1 -Adrenergic receptor, 
parathyroid hormone 1 receptor, parathyroid hormone 1 receptor, 5HT2B, platelet-derived 
growth factor receptor, mGLURla, mGLURS, SSTR2, SSTR2, IL8RB, CLl (a-latrotoxin), 
5HT2B, Bl AR, rat SSTR2, 5HT2C, SSTR2A, CIRLl, CIRL2, CIRLl & 2, prolactin-releasing 
peptide receptor, kappa opioid receptor, mGLUR7„ 

V. Detection of PDZ Domain-Containing Proteins 

As noted supra, the present inventors have identified a number of PDZ protein 
and PL protein interactions that can play a role in modulation of a number of biological 
functions in a variety of cell types. A comprehensive list of PDZ domain-containing proteins 
was retrieved from the Sanger Centre database (Pfam) searching for the keyword, "PDZ". The 
corresponding cDNA sequences were retrieved from GenBank using the NCBI "entrez" 
database (hereinafter, "GenBank PDZ protein cDN A sequences"). The DN A portion encodinjg 
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PDZ domains was identified by alignment of cDNA and protein sequence using CLUSTALW. 
Based on the DNA/protein alignment information, primers encompassing the PDZ domains 
were designed. The expression of certain PDZ-containing proteins in cells was detected by 
polymerase chain reaction ("PGR") amplification of cDNAs obtained by reverse transcription 
5 ("RT") of cell-derivedRNA (i.e., "RT-PCR"). PCR, RT-PCR and other methods for analysis 
and manipulation of nucleic acids are well known and are described generally in Sambrook et 
al., (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed., Vols. 1-3, Cold Spring 
Harbor Laboratory hereinafter, "Sambrook"); and Ausubel et al.. Current Protocols in 
Molecular Biology, Greene Publishing and Wiley-Interscience, New York (1997), as 

10 supplemented through January 1999 (hereinafter "Ausubel"). 

Samples of cDNA for those sequences identified through the foregoing search 
were obtained and then amplified. In general a sample of the cDNA (typically, 1/5 of a 20 |il 
reaction) was used to conduct PCR. PCR was conducted using primers designed to amplify 
specifically PDZ domain-containing regions of PDZ proteins of interest. Oligonucleotide 

15 primers were designed to amplify one or more PDZ-encoding domains. The DNA sequences 
encoding the various PDZ domains of interest were identified by inspection (i.e., conceptual 
translation of the PDZ protein cDNA sequences obtained from GenBank, followed by 
alignment with the PDZ domain amino acid sequence). TABLE 6 shows the PDZ-encoded 
domains amplified, and the GenBank accession number of the PDZ-domain containing 

20 proteins. To facilitate subsequent cloning of PDZ domains, the PCR primers included 
endonuclease restriction sequences at their ends to allow ligation with pGEX-3X cloning vector 
(Pharmacia, GenBank XXII 3 852 ) in frame with glutathione-S transferase (GST). 

TABLE 6 lists the proteins isolated for use in the aforementioned assays. 

25 VI. Assays for Detection of Interactions Between PDZ-Domain Polypeptides and . 
Candidate PDZ Ligand proteins (PL proteins^ 

Two complementary assays, termed "A* and "G,"" were developed to detect 
binding between a PDZ-domain polypeptide and candidate PDZ ligand. In each of the two 
different assays, binding is detected between a peptide having a sequence corresponding to the 
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C-terminus of a protein anticipated to bind to one or more PDZ domains (i.e. a candidate PL 
peptide) and a PDZ-domainpolypeptide (typically a flision protein containing a PDZ domain). 
In the "A" assay, the csuididate PL peptide is immobilized and binding of a soluble PDZ- 
domain polypeptide to the immobilized peptide is detected (the "A"' assay is named for the 
fact that in one embodiment an avidin surface is used to immobilize the peptide). In the "G" 
assay, the PDZ-domain polypeptide is immobilized and binding of a soluble PL peptide is 
detected (The "G" assay is named for the fact that in one embodiment a GST-binding surface 
is used to immobilize the PDZ-domainpolypeptide). Preferred embodiments of these assays 
are described in detail infra. However, it will be appreciated by ordinarily skilled practitioners 
that these assays can be modified in numerous ways while remaining useful for the purposes 
of the present invention. 

A. Production of Fusion Proteins Containing PDZ-Domains 

GST-PDZ domain fusion proteins were prepared for use in the assays of the 
invention. PGR products containing PDZ encoding domains (as described supra) were 
subcloned into an expression vector to permit expression of fusion proteins containing a PDZ 
domain and a heterologous domain (i.e., a glutathione-S transferase sequence, "GST"). PGR 
products (i.e., DNA fragments) representing PDZ domain encoding DNA was extracted from 
agarose gels using the "sephaglas" gel extraction system (Pharmacia) according to the 
manufacturer's recommendations. 

As noted supra ^ PGR primers were designed to include endonuclease restriction 
sites to facilitate ligation of PGR fragments into a GST gene fusion vector (pGEX-3X; 
Pharmacia, GenBank accession no. XXU13852) in-frame with the glutathione-S transferase 
coding sequence. This vector contains a IPTG inducible lacZ promoter. ThepGEX-3X vector 
was linearized using Bam HI and Eco RI or, in some cases, Eco RI or Sma I, and 
dephosphorylated. For most cloning approaches, double digestion with Bam HI emd Eco RI 
was performed, so that the ends of the PGR fragments to clone were Bam HI and Eco RI. In 
some cases, restriction endonuclease combinations used were Bgl II and Eco RI, Bam HI and 
Mfe I, or Eco RI only, Sma I only, or BamHI only. When more than one PDZ domain was 
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cloned, the DNA portion cloned represents the PDZ domains and the cDNA portion located 
between individual domains. Precise locations of cloned fragments used in the assays are 
indicated in TABLE 6. DNA linker sequences between the GST portion and the PDZ domain 
containing DNA portion vary slightly, dependent on which of the above described cloning sites 
and approaches were used. As a consequence, the amino acid sequence of the GST-PDZ fusion 
protein varies in the linker region between GST and PDZ domain. Protein linkers sequences 
corresponding to different clomng sites/approachesare shown below. Linker sequences (vector 
DNA encoded) are bold, PDZ domain containing gene derived sequences are in italics. 

1) GST — BamHI/BamHI— PDZ domain insert 
Gly— He — PDZ domain insert 

2) GST— BamHI/5^///— PZ)Z domain insert 
Gly — ^Ile — PDZ domain insert 

3) GST— "EcoRllEcoI— PDZ domain insert 

Gly — He — Pro— Gly~Asn — PDZ domain insert 

4) GST'-Smal/Smal— PDZ domain insert 
Gly — ^Ile — Pro — PDZ domain insert 

The PDZ-encoding PGR fragment and linearized pGEX-3X vector were ethanol 
precipitated and resuspended in 1 0 ul standard ligation buffer. Ligation was performed for 4-10 
hours at l^'C using T4 DNA ligase. It will be understood that some of the resulting constructs 
include very short linker sequences and that, when multiple PDZ domains were cloned, the 
constructs included some DNA located between individual PDZ domains. 

The ligation products were transformed in DH5a or BL-21 Rcoli bacteria 
strains. Colonies were screened for presence and identity of the cloned PDZ domain containing 
DNA as well as for correct fusion with the glutathione S-transferase encoding DNA portion by 
PGR and by sequence analysis. Positive clones were tested in a small sceile assay for 
expression of the GST/PDZ domain fusion protein and, if expressing, these clones were 
subsequently grown up for large scale preparations of GST/PDZ fusion protein. 

GST-PDZ domain fusion protein was overexpressed following addition of 
IPTG to the culture medium and purified. Detailed procedure of small scale and large scale 
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fusion protein expression and purification are described in "GST Gene Fusion System" (second 
edition, revision 2; published by Pharmacia). In brief, a small culture (3-5mIs) containing a 
bacterial strain (DH5a, BL21 or JM109) with the fusion protein construct was grown 
overnight in LB-media at 37°C with the appropriate antibiotic selection (lOOug/ml ampicillin; 
5 a.k.a. LB-amp). The overnight culture was poured into a fresh preparation of LB-amp 
(typically 250-500mls) and grown until the optical density (OD) of the culture was between 
0.5 and 0.9 (approximately 2.5 hours). IPTG (isopropyl p-D-thiogalactopyranoside) was 
added to a final concentration of 1 .OmM to induce production of GST fusion protein, and 
culture was grown an additional 1.5-2.5 hours. Bacteria were collect by ceiitrifugation (4500 

10 g) and resuspended in Buffer A- (50mMTris,pH 8.0, 50mM dextrose, ImMEDTA, 200uM 
phenylmethylsulfonylfluoride). An equal volume of Buffer A+ (Buffer A-, 4mg/ml lysozyme) 
was added and incubated on ice for 3 min to lyse bacteria. An equal volume of Buffer B 
(lOmM Tris, pH 8.0, 50mM KCl, ImM EDTA. 0.5% Tween-20, 0.5%NP40 (a.k.a. IGEPAL 
CA-630), 200uM phenylmethylsulfonylfluoride)was added and incubated for an additional 20 

15 min. The bacterial cell lysate was centrifuged (x20,000g), and supernatant was added to 
glutathione Sepharose 4B (Pharmacia, cat no. 17-0765-01) previously swelled (rehydrated) in 
IX phosphate-buffered saline (PBS). The supematant-Sepharose slurry was poured into a 
column and washed with at least 20 bed volumes of IX PBS. GST fusion protein was eluted 
off the glutathione sepharose by applying 0.5-1.0 ml aliquots of 5mM glutathione and 

20 collected as separate fractions. Concentrations of fractions were determined using BioRad 
. Protein Assay (cat no. 500-0006) according to manufacturer's specifications. Those fractions 
containing the highest concentration of fusion protein were pooled and dialyzed against IX 
PBS/35% glycerol. Fusion proteins were assayed for size and quality by SDS gel 
electrophoresis (PAGE) as described in "Sambrook." Fusion protein aliquots were stored at 

25 minus 80''C and at minus 20''C. 

B. Identification of Candidate PL Proteins and Svnthesis of Peptides 

Certain PDZ domains are bound by the C-terminal residues of PDZ-binding 
proteins. To identify PL proteins the C-terminal residues of sequences were visually inspected 
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for sequences that one might predict would bind to PDZ-domain containing proteins (see, e.g., 
Doyle et al., 1996, Cell 85, 1067; Songyanget al., 1997, Science 275, 73). TABLE 3 lists these 
proteins, and provides corresponding C-terminal sequences and GenBank accession numbers. 

Synthetic peptides of defined sequence (e.g., corresponding to the carboxyl- 
5 termini of the indicated proteins) can be synthesized by any standard resin-based method (see, 
e.g., U. S. Pat. No. 4,108,846; see also, Caruthers et al., 1980, Nucleic Acids Res. Symp, Ser., 
215-223; Horn et al., 1980, Nucleic Acids Res. Symp. Sen, 225-232; Roberge, et al., 1995, 
Science 269:202). The peptides used in the assays described herein were prepared by the 
FMOC (see, e.g., Guy and Fields, 1997, Meth. Enz. 289:67-83; Wellings and Atherton, 1997, 

10 Meth. £'«z.289:44-67). In some cases (e.g., for use in the A and G assays of the invention), 
peptides were labeled with biotin at the amino-terminus by reaction with a four- fold excess of 
biotin methyl ester in dimethylsulfoxide with a catalytic amoxmt of base. The peptides were 
cleaved from the resin using a halide containing acid (e.g. trifluoroacetic acid) in the presence of 
appropriate antioxidants (e.g. ethanedithiol) and excess solvent lyophilized. 

15 ' Following lyophilization, peptides can be redissolved and purified by reverse 

phase high performance liquid chromatography (HPLC). One appropriate HPLC solvent 
system involves a Vydac C-18 semi-preparative coluxrm running at 5 mL per minute with 
increasing quantities of acetonitrileplus 0. 1 % trifluoroacetic acid in a base solventof water plus 
0. 1% trifluoroacetic acid. After HPLC purification, the identities of the peptides are confirmed 

20 by MALDI cation-mode mass spectrometry. As noted, exemplary biotinylated peptides are 
provided in TABLE 3. 

C. Detecting PDZ-PL Interactions 

The present inventors were able in part to identify the interactions summarized 
25 in TABLE 2 by developing new high throughput screening assays which are described in greater 
detail infra. Various other assay formats known in the art can be used to select ligands that are 
specifically reactive with a particular protein. For example, solid-phase ELISA immunoassays, 
immunoprecipitation, Biacore, and Western blot assays can be used to identify peptides that 
specifically bind PDZ-domain polypeptides. As discussed supra, two different, 
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complementary assays were developed to detect PDZ-PL interactions. In each, one binding 
partner of a PDZ-PL pair is immobilized, and the ability of the second binding partner to bind 
is determined. These assays, which are described infra^ can be readily used to screen for 
hundreds to thousand of potential PDZ-ligand interactions in a few hours. Thus these assays 
5 can be used to identify yet more novel PDZ-PL interactions in hematopoietic cells. In addition, 
they can be used to identify antagonists of PDZ-PL interactions (see infra). 

In various embodiments, fusionprotein are used in the assays and devices of the 
invention. Methods for constructing and expressing fusion proteins are well known. Fusion 
protems generally are described in Ausubel et al., supra, KroU et al., 1993, DNA Cell. Biol. 

10 1 2:44 1 , and Imai et aL, 1 997, Cell 91:521 -30. Usually, the fusion protein includes a domain to 
facilitate immobilization of the protein to a solid substrate ("an immobilization domain"). 
Often, the immobilization domain includes an epitope tag (i.e., a sequence recognized by a 
antibody, typically a monoclonal antibody) such as polyhistidine (Bush et al, 1991, J. Biol 
Chem 266:1381 1-14), SEAP (Berger et al, 1988, Gene 66:1-10), or Ml and M2 flag (see, e.g, 

15 U.S. Pat. Nos. 5,011,912; 4,851,341; 4,703,004; 4,782,137).' In an embodiment, the 
immobilization domain is a GST coding region. It will be recognized that, in addition to the 
PDZ-domain and the particular residues bound by an immobilized antibody, protein A, or 
otherwise contacted with the surface, the protein (e.g., fusion protein), will contain additional 
residues. In some embodiments these are residues naturally associated with the PDZ-domain 

20 (i.e., in a particular PDZ-protein) biit they may include residues of synthetic (e.g., 
poly(alanine)) or heterologous origin (e.g., spacers of, e.g., between 10 and 300 residues). 

PDZ domain-containingpolypeptide used in the methodsof the invention(e.g., 
PDZ fusion proteins) of the invention are typically made by (1) constructing a vector (e.g., 
plasmid, phage or phagemid) comprising a polynucleotide sequence encoding the desired 

25 polypeptide, (2) introducing the vector into ah suitable expression system (e.g., a prokaryotic, 
insect, mammalian, or cell free expression system), (3) expressing the fusion protein and (4) 
optionally purifying the fusion protein. 

(1) In one embodiment, expression of the protein comprises insertuig the 
coding sequence into an appropriate expression vector (i.e., a vector that contains the necessary 
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elements for the transcription and translation of the inserted coding sequence required for the 
expression system employed, e.g., control elements including enhancers, promoters, 
transcription terminators, origins of replication, a suitable initiation codon (e.g., methionine), 
open reading frame, and translational regulatory signals (e.g., a ribosome binding site, a 
5 termination codon and a polyadeny lation sequence. Depending on the vector system and host 
utilized, any number of suitable transcription and translation elements, including constitutive 
and inducible promoters, can be used: 

The coding sequence of the fusion protein includes a PDZ domain and an 
immobilization domain as described elsewhere herein. Poly nucleotidesencoding the amino acid 

10 sequence for each domain can be obtained in a variety of ways known in the art; typically the 
polynucleotides are obtained by PCR amplification of cloned plasmids, cDNA libraries, and 
cDNA generated by reverse transcription of RN A, using primers designed based on sequences 
determined by the practitioner or, more often, publicly available (e.g., through GenBank). The 
primers include linker regions (e.g., sequences including restriction sites) to facilitate cloning and 

1 5 manipulation in production of the fusion construct. The polynucleotides corresponding to the 
PpZ and immobilization regions are joined in-frame to produce the fusion protein-encoding 
sequence. 

The fusion proteins of the invention may be expressed as secreted proteins 
(e.g., by including the signal sequence encoding DNA in the fusion gene; see, e.g., Lui et al, 
20 1993, PNAS USA, 90:8957-61) or as nonsecreted proteins. 

In some embodiments, the PDZ-containing proteins are immobilized on a solid 
surface. The substrate to which the polypeptide is bound may in any of a variety of forms, e.g., 
a microtiter dish, a test tube, a dipstick, a microcentrifuge tube, a bead, a spinnable disk, and 
the like. Suitable materials include glass, plastic (e.g., polyethylene, PVC, polypropylene, 
25 polystyrene, and the like), protein, paper, carbohydrate, lipip monolayer or supported lipid 
bilayer, and other solid supports. Other materials that may be employed include ceramics, 
metals, metalloids, semiconductive materials, cements and the like. 

In some embodiments, the ftision proteins are organized as an array. The term 
"array," as used herein, refers to ^n ordered arrangement of immobilized fusion proteins, in 
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which particular different fusion proteins (i.e., having different PDZ domains) are located at 
different predetermined sites on the substrate. Because the location of particular fusion 
proteins on the array is known, binding at that location can be correlated with binding to the 
PDZ domain situated at that location. Immobilization of fusion proteins on beads (individually 
5 or in groups) is another particularly useful approach. In one embodiment, individual fusion 
proteins are immobilized on beads. In one embodiment, mixtures of distinguishable beads are 
used. Distinguishable beads are beads that can be separated from each other on the basis of a 
property such as size, magnetic property, color (e.g., using FACS) or affinity tag (e.g., a bead 
coated with protein A can be separated from a bead not coated with protein A by using IgG 

10 affinity methods). Binding to particular PDZ domain may be determined; similarly, the effect 
of test compounds (i.e., agonists and antagonists of binding) may be determined. 

Methods for immobilizing proteins are known, and include covalent and non- 
covalent methods. One suitable immobilization method is antibody -mediated immobilization. 
According to this method, an antibody specific for the sequence of an "immobilization domain" 

15 of the PDZ-domain containing protein is itself immobilized on the substrate (e.g., by 
adsorption). One advantage of this approach is that a single antibody may be adhered to the 
substrate and used for immobilization of a nuniber of polypeptides (sharing the same 
inunobilization domain). For example, an immobilization domain consisting of poly-histidine 
(Bush et al, 1991, J. Biol Chem 266:1381 1-14) can be bound by an anti-histidine monoclonal 

20 antibody (R&D Systems, Minneapolis, MN); an immobilization domain consisting of secreted 
alkaline phosphatase ("SEAP") (Bergeretal, 1988, Gene 66:l-10)can be bound by anti-SEAP 
(Sigma Chemical Company, St. Louis, MO); an immobilization domain consisting of a FLAG 
epitope can be bound by anti-FLAG. Other ligand-antiligand immobilization methods are also 
suitable (e.g., an immobilization domain consisting of protein A sequences (Harlow and Lane, 

25 .1988, Antibodies A Laboratory Manual, Cold Spring Harbor Laboratory; Sigma Chemical Co., 
St. Louis, MO) can be bound by IgG; and an immobilization domain consisting of streptavidin 
can be bound by biotin (Harlow & Lane, supra; Sigma Chemical Co., St. Louis, MO). In a 
preferred embodiment, the immobilization domain is a GST moiety, as described herein. 

50 



When antibody-mediatedimmobilization methods are used, glass and plastic are 
especially useful substrates. The substrates may be printed with a hydrophobic (e.g.. Teflon) 
mask to form wells. Preprinted glass slides with 3, lOand 21 wells per 14.5 cm? slide "working 
area" are available from, e.g., SPI Supplies, West Chester, PA; also see U.S. Pat. No. 
5 4,01 1 ,350). In certain applications, a large format (12.4 cm x 8.3 cm) glass slide is printed in 
a 96 well format is used; this format facilitates the use of automated liquid handling equipment 
and utilization of 96 well format plate readers of various types (fluorescent, colorimetric, 
scintillation). However, higher densities may be used (e.g., more than 10 or 100 polypeptides 
per cm^). See, e.g., MacBeath et al, 2000, Science 289:1760-63. 

10 Typically, antibodies are bound to substrates (e.g., glass substrates) by 

adsorption. Suitable adsorption conditions are well known in the art and include incubation of 
0.5-50|ig/ml (e.g., 1 0 |Lig/ml) mAb in buffer (e.g., PBS, or 50 to 300 mM Tris, MOPS, HEPES, 
PIPES, acetate buffers, pHs 6.5 to 8, at 4^*0) to 37°C and from Ihr to more than 24 hours. 

Proteins may be covalently bound or noncovalently attached through 

15 nonspecific bonding. If covalent bonding between a the fusion protein and the surface is 
desired, the surface will usually be polyfunctional or be capable of being polyfunctionalized. 
Functional groups which may be present on the surface and used for linking can include 
carboxylic acids, aldehydes, amino groups, cyano groups, ethylenic groups, hydroxy 1 groups, 
mercapto groups and the like. The manner of linking a wide variety of compounds to various 

20 surfaces is well known and is amply illustrated in the literature. 

"A Assav" Detection of PDZ-Ligand Binding Using Immobilized PL Peptide. 

In one. aspect, the invention provides an assay in which biotinylated candidate 
PL peptides are immobilized on an avidin coated surface. The binding of PDZ-domain fusion 
25 protein to this surface is then measured. In a preferred embodiment, the PDZ-domain fusion 
protein is a GST/PDZ fusion protein and the assay is carried out as follows: 

(1) Avidin is bound to a surface, e.g. a protein binding surface. In one 
embodiment, avidin is bound to a polystyrene 96 well plate (e.g.. Nunc Poly sorb (cat #475094) 
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by addition of 100 |iL per well of 20 )ig/mL of avidin (Pierce) in phosphate buffered saHne 
without calcium and magnesium, pH 7.4 ("PBS", GibcoBRL) at for 12 hours. The plate 
is then treated to block nonspecific interactions by addition of 200 |iL per well of PBS 
containing 2 g per 1 00 mL protease-free bovine serum albumin ("PBS/BSA")for 2 hours at 4°C. 
5 The plate is then washed 3 times with PBS by repeatedly adding 200 |iL per well of PBS to 
each well of the, plate and then dumping the contents of the plate into a waste container and 
tapping the plate gently on a dry surface. 

(2) Biotinylated PL peptides (or candidate PL peptides, e.g. see TABLE 3) 
10 are immobiUzed on the surface of wells of the plate by addition of 50 |LiL per well of 0.4 |xM 

peptide in PBS/BSAfor 30 minutes at 4°C. Usually, each different peptide is added to at least 
eight different wells so that multiple measurements (e.g. duplicates and also measurements 
using different (3ST/PDZ-domain fusion proteins and a GST alone negative control) can be 
made, and also additional negative control wells are prepared in which no peptide is 
1 5 immobilized. Following immobilization of the PL peptide on the surface, the plate is washed 
3 times with PBS. 

(3) GST/PDZ-domain fusion protein (prepared as described supra) is 
allowed to react with the surface by addition of 50 |LlL per well of a solution containing 5 |Xg/mL 

20 GST/PDZ-domain fusion protein in PBS/BSA for 2 hours at 4''C. As a negative control, GST 
alone (i.e. not a fusion protein) is added to specified wells, generally at least 2 wells (i.e. 
duplicate measurements) for each immobilized peptide. After the 2 hour reaction, the plate is 
washed 3 times with PBS to remove unbound fusion protein. 

25 (4) The binding of the GST/PDZ-domain fusion protein to the avidin- 

biotinylated peptide surface can be detected using a variety of methods, and detectors known 
in the art. In one embodiment, 50 |LiL per well of an anti-GST antibody in PBS/BSA (e.g. 2.5 
\xg/mL of polyclonal goat-anti-GST antibody, Pierce) is added to the plate and allowed to react 
for 20 minutes at 4''C. The plate is washed 3 times with PBS and a second, detectably labeled 



antibody is added. In one embodiment, 50 |iL per well of 2.5 |Lig/mL of horseradish peroxidase 
(HRP)-conjugated polyclonal rabbit anti-goat immunoglobulin antibody is added to the plate 
and allowed to react for 20 minutes at 4°C. The plate is washed 5 times with 50 mM Tris pH 
8.0 containing 0.2% Tween 20, and developed by addition of 100 jiL per well of HRP-substrate 
5 solution (TMB, Dako) for 20 minutes at room temperature (RT). The reaction of the HRP and 
its substrate is terminated by the addition of 100 |iL per well of IM sulfuric acid and the 
optical density (O.D.) of each well of the plate is read at 450 nm. 

(5) Specific binding of a PL peptide and a PDZ-domain polypeptide is 

10 detected by comparing the signal from the well(s) in which the PL peptide and PDZ domain 
polypeptide are combined with the background signal(s). The background signal is the signal 
found in the negative controls. Typically a specific or selective reaction will be at least twice 
background signal, more typically more than 5 times background, and most typically 10 or 
more times the background signal. In addition, a statistically significant reaction will involve 

1 5 multiple measurements of the reaction with the signal and the background differing by at least 
two standard errors, more typically four standard errors, and most typically six or more 
standard errors. Correspondingly, a statistical test (e.g. a T-test) comparing repeated 
measurements of the signal with repeated measurements of the background will result in a p- 
value < 0.05, more typically a p-value < 0.01, and most typically a p-value < 0.001 or less. 

20 As noted, in an embodiment of the "A" assay, the signal from binding of a 

GST/PDZ-domain fusion protein to an avidin surface not exposed to (i.e. not covered with) the 
PL peptide is one suitable negative control (sometimes referred to as "B"). The signal from 
binding of GST polypeptide alone (i.e. not a fusion protein) to an avidin-coated surface that 
has been exposed to (i.e. covered with) the PL peptide is a second suitable negative control 

25 (sometimes referred to as "B2"). Because all measurements are done in multiples (i.e. at least 
duplicate) the arithmetic mean (or, equivalently, average) of several measurements is used in 
determining the binding, and the standard error of the mean is used in determining the probable 
error in the measurement of the binding. The standard error of the mean of N measurements 
equals the square root of the following: the sum of the squares of the difference between each 
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measurement and the mean, divided by the product of (N) and (N-1). Thus, in one 
embodiment, specific binding of the PDZ protein to the plate-bound PL peptide is determined 
by comparing the mean signal ("mean S") and standard error of the signal ("SE") for a particular 
PL-PDZ combination with the mean Bl and/or mean B2. 
5 ' ' 

" G Assay" - Detection of PDZ-Ligand Binding Using Immobilized PDZ- 
Domain Fusion Polypeptide 

In one aspect, the invention provides an assay in which a GST/PDZ fusion 
protein is immobilized on a surface ("G" assay). The binding of labeled PL peptide (e.g., as . 
10 listed in TABLE 2) to this surface is then measured. In a preferred embodiment, the assay is 
carried out as follows: . 

(1) A PDZ-domain polypeptide is bound to a surface, e.g. a protein binding . 
surface. In a preferred embodiment, a GST/PDZ fusion protein containing one or more PDZ 

1 5 domains is bound to a polystyrene 96- well plate. The GST/PDZ fusion protein can be biDund 
to the plate by any of a variety of standard methods known to one of skill in the art, although 
some care must be taken that the process of binding the fusion protein to the plate does not 
alter the ligand-binding properties of the PDZ domain. In one embodiment, the GST/PDZ 
fusion protein is bound via an anti-GST antibody that is coated onto the 96-well plate. 

20 Adequate binding to the plate can be achieved when: 

a. 1 00 |LiL per well of 5 |-ig/mL goat anti-GST polyclonal antibody 
(Pierce) in PBS is added to a polystyrene 96-well plate (e.g., Nunc Polysorb) at 4*^0 for 12 
hours. 

b. The plate is blocked by addition of 200 |iL per well of PBS/BS A 

25 for 2 hours at 4^C. 

c. The plate is washed 3 times with PBS. 

d. 50 (xL per well of 5 ^ig/mL GST/PDZ fusion protein) or, as a 
negative control, GST polypeptide alone (i.e. not a fusion protein) in PBS/BS A is added to the 
plate for 2 hours at 4°C. 



e. the plate is again washed 3 times with PBS. 



(2) Biotinylated PL peptides are allowed to react with the surface by 
addition of 50 \iL per well of 20 \iM solution of the biotinylated peptide in PBS/BSA for 10 

5 minutes at 4''C, followed by an additional 20 minute incubation at 25°C. The plate is washed 
3 times with ice cold PBS. 

(3) . The binding of the biotinylated peptide to the GST/PDZ fusion protein 
surface can be detected using a variety of methods and detectors known to one of skill in the 

10 art. In one embodiment, 100 |iL per well of 0.5 |ig/mL streptavidin-horse radish peroxidase 
(HRP) conjugate dissolved in BSA/PBS is added and allowed to react for 20 minutes at 4°C. 
The plate is then washed 5 times with 50 mM Tris pH 8.0 contaming 0.2% Tween 20, and 
developed by addition of 100 ^iL per well of HRP-substrate solution (TMB, Dako) for 20 
minutes at room temperature (RT). The reaction of the HRP and its substrate is terminated by 

15 addition of 100 |xL per well of 1 M sulfuric acid, and the optical density (O.D.) of each well 
of the plate is read at 450 um. 

(4) Specific binding of a PL peptide and a PDZ domain polypeptide is 
determined by comparing the signal from the well(s) in which the PL peptide and PDZ domain 

20 polypeptide are combined, with the background signal(s). The background signal is the signal 
found in the negative control(s). Typically a specific or selective reaction will be at least twice 
background signal, more typically more than 5 times background, and most typically 10 or 
more times the background signal. In addition, a statistically significant reaction will involve 
multiple measurements of the reaction with the signal and the background differing by at least 

25 two standard errors, more typically four standard errors, and most typically six or more 
standard errors. Correspondingly, a statistical test (e.g. a' T-test) comparing repeated 
measurements of the signal with -repeated measurements of the background will result in a p- 
value < 0.05, more typically a p-value < 0.01, and most typically a p-value < 0.001 or less. 
As noted, in an embodiment of the "G" assay, the signal from binding of a given PL peptide 
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to immobilized (surface bound) GST polypeptide alone is one suitable negative control 
(sometimes referred to as "B 1 "). Because all measurement are done in multiples (i.e. at least 
duplicate) the arithmetic mean (or, equivalently, average.) of several measurements is used in 
determining the binding, and the standard error of the mean is used in determining the probable 
5 error in the measurement of the binding. The standard error of the.mean of N measurements 
equals the square root of the following: the sum of the squares of the difference between each 
measurement and the mean, divided, by the product of (N) and (N-1). Thus, in one 
embodiment, specific binding of the PDZ protein to the platebound peptide is determined by 
comparing the mean signal ("mean S") and standard error of the signal ("SE") for a particular 
1 0 PL-PDZ combination with the mean B 1 . 

"G^ assay" and "G^^ assay" 

Two specific modifications of the specific conditions described supra for the 
"G assay" are particularly useful. The modified assays use lesser quantities of labeled PL 
peptide and have slightly different biochemical requirements for detection of PDZ-ligand 

1 5 binding compared to the specific assay conditions described supra. 

> For convenience, the assay conditions described in this section are referred to 

as the "G' assay" and the "G" assay," with the specific conditions described in the preceding 
section on G assays being referred to as the "G^ assay." The "G' assay" is identical to the "G^ 
assay" except at step (2) the peptide concentration is 10 uM instead of 20 uM. This results 

20 in slightly lower sensitivity for detection of interactions with low affinity and/or rapid 
dissociation rate. Correspondingly, it slightly increases the certainty that detected interactions 
are of sufiBcient affinity and half-life to be of biological importance and useful therapeutic 
targets. 

The "G" assay" is identical to the "G*' assay" except that at step (2) the 
25 peptide concentration is 1 |JiM instead of 20 |LlM and the incubation is performed for 60 
minutes at 25''C (rather than, e.g., 10 minutes at 4°C followed by 20 minutes at 25°C). This 
results in lower sensitivity for interactions of low affinity, rapid dissociation rate, and/or 
affinity that is less at 25*^0 than at 4°C. Interactions will have lower affinity at 25°C than at 
4*'C if (as we have found to be generally true for PDZ-ligand binding) the reaction entropy is 
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negative (i.e. the entropy of the products is less than the entropy of the reactants). In contrast, 
the PDZ-PL binding signal may be similar in the "G" assay" and the "G® assay" for 
interactions of slow association and dissociation rate, as the PDZ-PL complex will accumulate 
during the longer incubation of the "G" assay." Thus comparison of results of the "G' ' assay" 
and the "G^ assay" can be used to estimate the relative entropies, enthalpies, and kinetics of 
different PDZ-PL interactions. (Entropies and enthalpies are related to binding affinity by the 
equations delta G = RT In (Kd) = delta H - T delta S where delta G, H, and S are the reaction 
free energy, enthalpy, and entropy respectively, T is the temperature in degrees Kelvin, R is 
the gas constant, and Kd is the equilibrium dissociation constant). In particular, interactions 
that are detected only or much more strongly in the "G^ assay" generally have a rapid 
dissociation rate at 25°C (tl/2 < 10 minutes) and a negative reaction entropy, while 
interactions that are detected similarly strongly in the "G" assay" generally have a slower 
dissociation rate at 25°C (tl/2 > 10 minutes). Rough estimation of the thermodynamics and 
kinetics of PDZ-PL interactions (as can be achieved via comparison of results of the "G^ assay" 
versus the "G' ' assay" as outlined supra) can be used in the design of efficient inhibitors of the 
interactions. For example, a small molecule inhibitor based on the chemical structure of a PL 
that dissociates slowly from a given PDZ domain (as evidenced by similar binding in the "G" 
assay" as in the "G^ assay") may itself dissociate slowly and thus be of high affinity. 

In this maimer, variation of the temperature and duration of step (2) of the "G 
assay" can be used to provide insight into the kinetics and thermodynamics of the PDZ-ligand 
binding reaction and into design of inhibitors of the reaction. 

Assay Variations , 

As discussed supra, it will be appreciated that many of the steps in the above- 
described assays can be varied, for example, various substrates can be used for binding the PL 
and PDZ-containing proteins; different types of PDZ containing fusion proteins can be used; 
different labels for detecting PDZ/PL interactions can be employed; and different ways of 
detection can be used. 

The PL protein used in the assay is not intended to be limited to a 20 amino 
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acid peptide. Full lengthor partial protein may be used, either alone or as a fusionprotein. For 
example, a GST-PL protein fusion may be bound to the anti-GST antibody, with PDZ protein 
added to the bound PL protein or peptide. 

The PDZ-PL detection assays can employ a viariety of surfaces to bind the PL 
and PDZ-containing proteins. For example, a surface can be an "assay plate" which is formed 
from a material (e.g. polystyrene) which optimizes adherence of either the PL protein or PDZ- 
containing protein thereto. Generally, the individual wells of the assay plate will have a high 
surface area to volume ratio and therefore a suitable shape is a flat bottom well (where the 
proteins of the assays are adherent). Other surfaces include, but are not limited to, polystyrene 
or glass beads, polystyrene or glass slides, and the like. 

For example, the assay plate can be a "microtiter" plate. The term "microtiter" 
plate when used herein refers to a multiwell assay plate, e.g., having between about 30 to 200 
individual wells, usually 96 wells. Altematively, high density arrays can be used. Often, the 
individual wells of the microtiter plate will hold a maximum volume of about 250 ul. 
Conveniently, the assay plate is a 96 well polystyrene plate (such as that sold by Becton 
Dickinson Labware, Lincoln Park, N.J.), which allows for automation and high throughput 
screening. Other surfaces include polystyrene microtiter ELISA plates such as that sold by 
Nunc Maxisorp, Inter Med, Denmark. Often, about 50 ul to 300 ul, more preferably 100 ul to 
200 ul, of an aqueous sample comprising buffers suspended therein will be added to each well 
of the assay plate. 

The detectable labels of the invention can be any detectable compound or 
composition which is conjugated directly or indirectly with a molecule (such as described 
above). The label can be detectable by itself (e.g., radioisotope labels or fluorescent labels) or, 
in the case of an enzymatic label, can catalyze a chemical alteration of a substrate compound 
or composition which is detectable. The preferred label is an enzymatic one which catalyzes 
a color change of a non-radioactive color reagent. 

Sometimes, the label is indirectly conjugated with the antibody. One of skill is 
aware of various techniques for indirect conjugation. For example, the antibody can be 
conjugated with biotin and any of the categories of labels mentioned above can be conjugated 
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with avidin, or vice versa (see also "A" and "G" assay above). Biotin binds selectively to 
avidin and thus, the label can be conjugated with the antibody in this indirect manner. See, 
Ausubel, supra^ for a review of techniques involving biotin-avidin conjugation and similar, 
assays. Alternatively, to achieve indirect conjugation of the label with the antibody, the 
5 antibody is conjugated with a small hapten (e.g. digoxin)and one of the different types of labels 
mentioned above is conjugated with an anti-hap ten antibody (e.g. anti-digoxin antibody). Thus, 
indirect conjugation of the label with the antibody can be achieved. 

Assay variations can include different washing steps. By "washing" is meant 
exposing the solid phase to an aqueous solution (usually a buffer or cell culture media) in such 

10 a way that unbound material (e.g., non-adhering cells, non-adhering capture agent, unbound 
ligand, receptor, receptor construct, cell lysate, or HRP antibody) is removed therefrom. To 
reduce background noise, it is convenient to include a detergent (e.g., Triton X) in the washing 
solution. Usually, the aqueous washing solution is decanted from the wells of the assay plate 
following washing. Conveniently, washing can be achieved usingW automated washing device. 

1 5 Sometimes, several washing steps (e.g., between about 1 to 1 0 washing steps) can be required. 

Various buffers can also be used in PDZ-PL detection assays. For example, 
various blocking buffers can be used to reduce assay background. The term "blocking buffer" 
refers to ah aqueous, pH buffered solution containing at least one blocking compound which 
is able to bind to exposed surfaces of the substrate which are not coated with a PL or PDZ- 

20 containing protein. The blocking compound is normally a protein such as bovine serum 
albumin (BSA), gelatin, casein or milk powder and does not cross-react with any of the reagents 
in the assay. The block buffer is generally provided at a pH between about 7 to 7.5 and 
suitable buffering agents include phosphate and TRIS. 

Various enzyme-substrate combinations can also be utilized in detecting PDZ- 

25 PL interactions. Examples of enzyme-substrate combinations include, for example: 

(i) Horseradish peroxidase (HRPO) with hydrogen peroxidase as a substrate, 
wherein the hydrogen peroxidase oxidizes a dye precursor (e.g. orthopheny lene diamine [OPD] 
or 3,3',5,5'-tetramethyl benzidine hydrochloride [TMB]) (as described above). 

(ii) alkaline phosphatase (AP) with para-Nitrophenyl phosphate as 
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chromogenic substrate. 

(iii) fl-D-galactosidase (B D-Gal) with a chromogenic substrate (e.g. p- 
nitrophenyl- B-D-galactosidase) or fluorogenic substrate 4-methylumbelliferyl- B-D- 
galactosidase. 

5 Numerous other enzyme-substrate combinations are available to those skilled 

in the art. For a general review of these, see U.S. Pat. Nos. 4,275,149 and 4,318,980, both of 
which are herein incorporated by reference. 

Further, it will be appreciated, that, although, for convenience, the present 
discussion primarily refers antagonists of PDZ-PL interactions, agonists of PDZ-PL 
1 0 interactions can be identified using the methods disclosed herein or readily apparent variations 
thereof. 

VII. Results of PDZ-PL Interaction Assays 

TABLE 2, supra, shows the results of assays in which specific binding was 
15 detected using the "G"' assay described herein. 

VIIL Measurement of PDZ-Ligand Binding Affinity 

The "A" and "G" assays of the invention can be used to determine the 
"apparent affinity" of binding of a PDZ ligand peptide to a PDZ-domain polypeptide. 
20 Apparent affinity is determined based on the concentration of one molecule required to saturate 
the binding of a second molecule (e.g., the binding of a ligand to a receptor). Two particularly 
useful approaches for quantitation of apparent affinity of PDZ-ligand binding are provided 
infra. 

(1) A GST/PDZ fusion protein, as well as GST alone as a negative control, are 
25 bound to a surface (e.g., a 96- well plate) and the surface blockedand washed as described supra 

for the "G" assay. 

(2) 50 jiL per well of a solution of biotinylated PL peptide (e.g. as shown in 
TABLE 3) is added to the surface in increasing concentrationsm PBS/BSA (e.g. at 0. 1 |iM, 0.33 
p,M, 1 fxM, 3.3 |LiM, 10 |JiM, 33 ^iM, and 100 |iM). In one embodiment, the PL peptide is 
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allowed to react with the bound GST/PDZ fusion protein (as well as the GST alone negative 
control) for 10 minutes at 4^C followed by 20 minutes at 25°C. The plate is washed 3 times 
with ice cold PBS to remove unbound labeled peptide. 

(3) The binding of the PL peptide to the immobilized PDZ-domain polypeptide is 
5 detected as described supra for the "G" assay. 

(4) For each concentration of peptide, the net binding signal is determined by 
subtracting the binding of the peptide to GST alone from the bmding of the peptide to the 
GST/PDZ fusion protein. The net binding signal is then plotted as a function of ligand 
concentration and the plot is fit (e.g. by using the Kaleidagraph software package curve fitting 

10 algorithm) to the following equation, where "Signal^iigand]" is the net binding signal at PL peptide 
concentration "[ligand]," ''Kd" is the apparent affinity of the binding event, and "Saturation 
Binding" is a constant determined by the curve fitting algorithm to optimize the fit to the 
experimental data: 

15 Signaljiigand] = Saturation Binding x ([ligand] / ([ligand] + Kd)) 

For reliable application of the above equation it is necessary that the highest 
peptide ligand concentration successfully tested experimentally be greater than, or at least 
similar to, the calculated Kd (equivalently, the maximum observed binding should be similar to 
the calculated saturation binding). Incases where satisfying the above criteria proves difficult, 
20 an alternative approach (infi-a) can be used. 

Approach 2: 

(1) A fixed concentration of a PDZ-domain polypeptide and increasing 
concentrations of a labeled PL peptide (labeled with, for example, biotin or fluorescein, see 
TABLE 2for representative peptide amino acid sequences) are mixed together in solution and 
25 allowed to react. In one embodiment, preferred peptide concentrations are 0.1 |iM, 1 |iM, 10 
|LiM, 100 |iM, 1 mM. In various embodiments, appropriate reaction times can range from 10 
minutes to 2 days at temperatures ranging fi-om 4°C to 37°C. In some embodiments, the 
identical reaction can also be carried out using a non-PDZ domain-containing protein as a 



control (e.g., if the PDZ-domain polypeptide is fusion protein, the fusion partner can be used). 

(2) PDZ-ligand complexes can be separated from unbound labeled peptide 
using a variety of methods known in the art. For example, the complexes can be separated using 
high performance size-exclusion chromatography (HPSEC, gel filtration) (Rabinowitz et al., 

5 1998, Immunity 9:699), affinity chromatography(e.g. using glutathione Sepharose beads), and 
affinity absorption (e.g., by binding to an anti-GST-coated plate as described supra). 

(3) The PDZ-ligand complex is detected based on presence of the label on 
the peptide ligandusing a variety of methods and detectors known to one of skill in the art. For 
example, if the label is fluorescein and the separation is achieved using HPSEC, an in-line 

10 fluorescence detector can be used. The binding can also be detected as described supra for the 
G assay. 

(4) The PDZ-ligand binding signal is plotted as a function of ligand 
concentration Euid the plot is fit. (e.g., by using the Kaleidagraph sofl:ware package curve 
fitting algorithm) to the following equation, where "Signal[iigand]" is the binding signal at PL 

15 peptide concentration "[ligand]," "Kd" is the apparent affinity of the binding 

event, and "Saturation Binding" is a constant determined by the curve fitting algorithm to 
optimize the fit tq the experimental data: 

Signal[Ligand] = Saturation Binding x ([ligand] / ([ligand + Kd]) 

20 

Measurement of the affinity of a labeled peptide ligand binding to a PDZ- 
domain polypeptide nis useful because knowledge of the affinity (or apparent affinity) of this 
interaction allows rational design of inhibitors of the interaction with known potency. The 
25 potency of inhibitors in inhibition would be similar to (i.e. within one-order of magnitude of) 
the apparent affinity of the labeled peptide ligand binding to the PDZ-domain. 

Thus, in one aspect, the invention provides a method of determining the 
apparent affinity of binding between a PDZ domain and a ligand by immobilizing a polypeptide 
comprising the PDZ domain and a non-PDZ domain on a surface, contacting the immobilized 
30 polypeptide with a plurality of different concentrations of the ligand, determining the amount 
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of binding of the ligandto the immobiHzedpolypeptide at each of the concentrations of Ugand, 
and calculating the apparent affinity of the binding based on that data. Typically, the 
polypeptide comprising the PDZ domain and a non-PDZ domain is a fusion protein. In one 
embodiment, the e.g., fusion protein is GST-PDZ fusion protein, but other polypeptides can 
5 also be used (e.g., a fusion protein includinga PDZ domain and any of a variety of epitope tags, 
biotinylation signals and the Hke) so long as the polypeptide can be immobilized In an 
orientation that does not abolish the ligand binding properties of the PDZ domain, e.g, by 
tethering the polypeptide to the surface via the non-PDZ domain via an anti-domain antibody 
and leaving the PDZ domain as the free end. It was discovered, for exainple, reacting a PDZ- 

10 GST fusion polypeptide directly to a plastic plate provided suboptimal results. The 
calculation of binding affinity itself can be determined using any suitable equation (e.g., as 
shown supra; also see Cantor and Schimmel (1980) Biophysical Chemistry WH Freeman & 
Co., San Francisco) or software. 

Thus, in apreferred embodiment,the polypeptide is immobilizedby binding the 

15 polypeptide to an immobilized immunoglobulin that binds the non-PDZ domain (e.g., an anti- 
GST antibody when a GST-PDZ fusion polypeptide is used). In a preferred embodiment, the 
step of contacting the ligand and PDZ-domain polypeptide is carried out under the conditions 
provided supra in the description of the "G" assay. It will be appreciated that binding assays 
are conveniently carried out in multiwell plates (e.g., 24- well, 96- well plates, or 384 well 

20 plates). 

The present method has considerable advantages over other methods for 
measuring binding affinities PDZ-PL affinities, which typically involve contacting varying 
concentrations of a GST-PDZ fusion protein to a ligand-coated surface. For example, some 
previously described methods for determining affinity (e.g., using immobilized ligand and GST- 
25 PDZ protein in solution) did not account for oligomerization state of the fusion proteins used, 
resulting in potential errors of more than an order of magnitude. 

Although not sufficient for quantitative measurement of PDZ-PL binding 
affinity, an estimate of the relative strength of binding of different PDZ-PL pairs can be made 
based on the absolute magnitude of the signals observed in the "G assay." This estimate will 



reflect several factors, including biologically relevant aspects of the interaction, including the 
affinity and the dissociation rate. For comparisons of different ligands binding to a given PDZ 
domain-containing protein, differences in absolute binding signal likely relate primarily to the 
affinity and/or dissociation rate of the interactions of interest. 

5 . 

IX. Assays to Identify Novel PDZ Domain Binding Moieties and to Identify Modulators 
of PDZ Protein-PL Protein Binding 

Although described 5 wpra primarily in terms of identifying interactions between 
PDZ-domain polypeptides and PL proteins, the assays described supra and other assays can 

1 0 also be used to identify the binding of other molecules (e.g., peptide mimetics, small molecules, 
and the like) to PDZ domain sequences. For example, using the assays disclosed herein, 
combinatorial and other libraries of compounds can be screened, e.g., for molecules that 
specifically bind to PDZ domains. Screening of libraries can be accomplished by any of a 
variety of commonly known methods. See, e.g., the following references, which disclose 

15 screening of peptide libraries: Parmley and Smith, 1989, Adv. Exp. Med. BioL 251:215-218; 
Scott and Smith, 1990, Science 249:386-390; Fowlkes et aL, 1992; BioTechniques 13:422-427; 
Oldenburg et al., 1992, Proc. Natl Acad Sci. USA 89:5393-5397; Yu et al., 1994, Cell 76:933- 
945; Staudt et aL, 1988, Science 241:577-580; Bock et al., 1992, Nature 355:564-566; Tuerk 
et aL, 1992, Proc. Natl. Acad. Sci. USA 89:6988-6992; Ellington et aL, 1992, Nature 355:850- 

20 852; U.S. Patent No. 5,096,815, U.S. Patent No. 5,223,409, and U.S. Patent No. 5,198,346, 
all to Ladner et al.; Rebar and Pabo, 1993, Science 263:671-673; and PCT Publication No. 
WO 94/18318. 

In a specific embodiment, screening can be carried out by contacting the library 
members with a PDZ-domain polypeptide immobilized on a solid support (e.g. as described 
25 supra in the "G" assay) and harvesting those library members that bind to the protein. 
Examples of such screening methods, termed "panning" techniques are described by way of 
example in Parmley and Smith, 1988, Gene 73:305-318; Fowlkes et aL, 1992, BioTechniques 
13:422-427; PCT Publication No. WO 94/18318; and in references cited hereinabove. 

In another embodiment, the two-hybrid system for selectmg interacting proteins 
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in yeast (Fields and Song, 1989, A^a/wr^ 340:245-246; ChienetaL, 1991, Proc. Natl, Acad ScL 
USA 88:9578-9582) can be used to identify molecules that specifically bind to aPDZ domain- 
containing protein. Furthemiore, the identified molecules are further tested for their ability to 
inhibit transmembrane receptor interactions with a PDZ domain. 

In one aspect of the invention, antagonists of an interaction between a PDZ 
protein and a PL protein are identified. In one embodiment, a modification of the "A" assay 
described supra is used to identify antagonists. In one embodiment, a modification of the "G" 
assay described supra is used to identify antagonists. 

In one embodiment, screening assays are used to detect molecules that 
specifically bind to PDZ domains. Such molecules are useful as agonists or antagonists of 
PDZ-protein-mediated cell function (e.g., cell activation, e.g., T cell activation, vesicle 
transport, cytokine release, grovs1;h factors, transcriptional changes, cytoskeleton 
rearrangement, cell movement, chemotaxis, and the like). In one embodiment, such assays are 
performed to screen for leukocyte activation inhibitors for drug development. The invention 
thus provides assays to detect molecules that specifically bind to PDZ domain-containing 
proteins. For example, recombinant ceils expressing PDZ domain-encoding nucleic acids can 
be used to produce PDZ domains in these assays and to screen for molecules that bind to the 
domains. Molecules are contacted with the PDZ domain (or fragment thereof) under conditions 
conducive to binding, and then molecules that specifically bind to such domains are identified. 
Methods that can be used to carry out the foregoing are commonly known in the art. 

It will be appreciated by the ordinarily skilled practitioner that, in one 
embodiment, antagonists are identified by conducting the A or G assays in the presence and 
absence of a known or candidate antagonist. When decreased binding is observed in the 
presence of a compound, that compoimd is identified as an antagonist Increased binding in the 
presence of a compound signifies that the compound is an agonist. 

For example, in one assay, a test compound can be identified as an inhibitor 
(antagonist) of binding between a PDZ protein and a PL protein by contacting a PDZ domain 
polypeptide and a PL peptide or protein in the presence and absence of the test compound, 
under conditions in which they would (but for the presence of the test compound) fomi a 
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complex, and detecting the formation of the complex in the presence and absence of the test 
compound. It will be appreciated that less complex formation in the presence of the test 
compound than in the absence of the compound indicates that the test compound is an inhibitor 
of a PDZ protein -PL protein binding. 
5 In one embodiment, the "G" assay is used in the presence or absence of an 

candidate inhibitor. In one embodiment, the "A" assay is used in the presence or absence of 
a candidate inhibitor. 

In one embodiment (in which a G assay is used), one or more PDZ domain- 
containing GST-fusion proteins are bound to the surface of wells of a 96- well plate as described 

10 supra (with appropriate controls including nonfusion GST protein). All fusion proteins are 
bound in multiple wells so that appropriate controls and statistical analysis can be done. A test 
compound in BSA/PBS (typically at multiple different concentrations) is added to wells. 
Immediately thereafter, 30 uL of a detectably labeled (e.g., biotinylated) PL peptide or protein 
known to bind to the relevant PDZ domain (see, e.g., TABLE 2) is added in each of the wells 

15 at a final concentration of, e.g., between about 2 )iM and about 40 |LiM, typically 5 |xM, 15 
(iM, or 25 |iM. This mixture is then allowed to react with the PDZ fusion protein bound to the 
surface for 10 minutes at 4^C followed by 20 minutes at 25°C. The surface is washed free of 
unbound PL polypeptide three times with ice cold PBS and the amount of binding of the 
polypeptide in the presence and absence of the test compound is determined. Usually, the 

20 level of binding is measured for each set of replica wells (e.g. duplicates) by subtracting the 
mean GST alone background from the mean of the raw measurement of polypeptide binding 
in these wells. 

In an alternative embodiment, the A assay is carried out in the presence or 
absence of a test candidate to identify inhibitors of PL-PDZ interactions. 
25 In one embodiment, a test compound is determined to be a specific inhibitor of 

the binding of the PDZ domain (P) and a PL (L) sequence when, at a test compound 
concentration of less than or equal to 1 mM (e.g., less than or equal to: 500 |iM, 100 ^jlM, 10 
|xM, 1 M,M, 100 nM or 1 nM) the binding of P to L in the presence of the test compound less 
than about 50% of the binding in the absence of the test compound, (in various embodiments, 
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less than about 25%, less than about 10%, or less than about 1%). Preferably, the net signal 
of binding of P to L in the presence of the test compound plus six (6) times the standard error 
of the signal in the presence of the test compound is less than the binding signal in the absence 
of the test compound. 

5 In one embodiment, assays for an inhibitor are carried out using a single PDZ 

protein-PL protein pair (e.g., a PDZ domain fusion protein and a PL peptide or protein). In 
a related embodiment, the assays are carried out using a plurality of pairs, such as a plurality 
of different pairs listed in TABLE 2. 

In some embodiments, it is desirable to identify compounds that, at a given 

10 concentration, inhibit the binding of one PL-PDZ pair, but do not inhibit (or inhibit to a lesser 
degree) the binding of a specified second PL-PDZ pair. These antagonists can be identified by 
carrying out a series of assays using a candidate inhibitor and different PL-PDZ pairs (e.g.j as 
shown in the matrix of TABLE 2) and comparing the results of the assays. All such pairwise 
combinations are contemplated by the invention (e.g., test compound inhibits binding of PL| 

15 to PDZi to a greater degree than it inhibits bmding of PLi to PDZ2 or PL2 to PDZ2). 
Importantly, it will be appreciated that, based on the data provided in TABLE 2 and disclosed 
herein (and additional data that can be generated using the methods described herein) inhibitors 
with different specificities can readily be designed. 

For example, according to the invention, the Ki ("potency") of an inhibitor of 

20 a PDZ-PL interaction can be determined. Ki is a measure of the concentration of an inhibitor 
required to have a biological effect. For example, administration of an inhibitor of a PDZ-PL 
interaction in an amoimt sufficient to result in an intracellular inhibitor concentration of at least 
between about 1 and about 100 Ki is expected to inhibit the biological response mediated by 
the target PDZ-PL interaction. In one aspect of the invention, the Kd measurement of PDZ-PL 

25 binding as determined using the methods supra is used in determining Ki. 

Thus, in one aspect, the invention provides a method of determining the potency 
(Ki) of an inhibitor or suspected inhibitor of binding between a PDZ domain and a ligand by 
immobilizing a polypeptide comprising the PDZ domain and anon-PDZ domain on a surface, 
contacting the immobilized polypeptide with a plurality of different mixtures of the ligand and 
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inhibitor, wherein the different mixtures comprise a fixed amount of ligand and different 
concentrations of the inhibitor, determining the amount of ligand bound at the different 
concentrations of inhibitor, and calculating the Ki of the binding based on the amount of ligand 
bound in the presence of different concentrations of the inhibitor. In an embodiment, the 
5 polypeptide is immobilized by bindingthe polypeptide to an immobilized immunoglobulinthat 
binds the non-PDZ domain. This method, which is based on the "G" assay described supra^ 
« is particularly suited for high-throughput analysis of the Ki for inhibitors of PDZ-ligand 
interactions. Further, using this method, the inhibition of the PDZ-ligand interaction itself is 
measured, without distortion of measurements by avidity effects. 

10 Typically, at least a portion of the ligand is detectably labeled to permit easy 

quantitation of ligand binding. 

It will be appreciated that the concentration of ligand and concentrations of 
inhibitor are selected to allow meaningful detection of inhibition. Thus, the concentration of 
* the ligand whose binding is to be blocked is close to or less than its binding affinity (e.g., 

15 preferably less than the 5x Kd of the interaction, more preferably less than 2x Kd, most 
preferably less than Ix Kd). Thus, the ligand is typically present at a concentrationof less than 
2 Kd (e.g., between about 0.0 1 Kd and about 2 Kd) and the concentrations of the test inhibitor 
typically range from 1 nM to 100 fiM (e.g. a 4-fold dilution series with highest concentration 
10 |LiM or 1 mM). In a preferred embodiment, the Kd is determined using the assay disclosed 

20 supra. 

The Ki of the binding can be calculated by any of a variety of methods routinely 
used in the art, based on the amount of ligand bound in the presence of different concentrations 
of the inhibitor, in an illustrative embodiment, for example, a plot of labeled ligand binding 
versus inhibitor concentration is fit to the equation: 

25 Sinhibitor = So*Ki/([I]+Ki) 

where Si„hibitoris the signal of labeled ligand binding to immobilized PDZ domain in the presence 
of inhibitor at concentration [I] and So is the signal in the absence of inhibitor (i.e., [I] = 0). 
Typically [I] is expressed as a molar concentration. 
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In another aspect of the invention, an enhancer (sometimes referred to as, 
augmentor or agonist) of binding between a PDZ domain and a ligand is identified by 
immobilizing a polypeptide comprising the PDZ domain and a non-PDZ domain on a surface, 
contacting the immobilized polypeptide with the ligand in the presence of a test agent and 
determining the amount of ligand bound, and comp2iring the amount of ligand bound in the 
presence of the test agoit with the amount of ligand bound by the polypeptide in the absence 
of the test agent. At least two-fold (often at least 5-fold) greater binding in the presence of the 
test agent compared to the absence of the test agent indicates that the test agent is an agent that 
enhances the binding of the PDZ domain to the ligand. As noted supra, agents that enhance 
PDZ-ligand interactions are useful for disruption (dysregulation) of biological events requiring 
normal PDZ-ligand function (e.g., cancer cell division and metastasis, and activation and 
migration of immune cells). 

The invention also provides methods for determining the "potency" or 
"Kenhancer" of an cnhanccr of a PDZ- ligand interaction. For example, according to the 
invention, the Kenhancer of an enhancer of a PDZ-PL interaction can be determined, e.g., using 
the Kd of PDZ-PL binding as determined using the methods described supra, Kenhancer is a 
measure of the concentration of an enhancer expected to have a biological effect. For example, 
administration of an enhancer of a PDZ-PL interaction in an amount sufficient to result in an 
intracellular inhibitor concentration of at least between about 0.1 and about 100 Kenhancer (^-g? 
between about 0,5 and about 50 Kenhancer) is expected to disrupt the biological response 
mediated by the target PDZ-PL interaction. 

Thus, in one aspect the invention provides a method of determining the potency 
(Kenhancer) of an cnhanccr or suspected enhancer of binding between a PDZ domain and a ligand 
by immobilizing a polypeptide comprising the PDZ domain and a non-PDZ domain on a 
surface, contacting the immobilized polypeptide with a plurality of different mixtures of the 
ligand and enhancer, wherein the different mixtures comprise a fixed amount of ligand, at least 
a portion of which is detectably labeled, and different concentrations of the enhancer, 
determining the amount of ligand bound at the different concentrations of enhancer, and 
calculating the potency (Kenhancer) of the enhancer from the bindmg based on the amount of 
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ligand bound in the presence of different concentrations of the enhancer. Typically, at least a 
portion of the ligand is detectably labeled to permit easy quantitation of ligand binding. This 
method, which is based on the "G" assay described supra, is particularly suited for high- 
throughput analysis of the Kenhancer for enhancers of PDZ-ligand interactions. 
5 It will be appreciated that the concentration of ligand and concentrations of 

enhancer are selected to allow meaningful detection of enhanced binding. Thus, the ligand is 
typically present at a concentration of between about 0.01 Kd and about 0.5 Kd and the 
concentrations of the test agent/enhancer typically range from 1 nM to 1 mM (e.g. a 4-fold 
dilution series with highest concentration 10 |iiM or 1 mM). In a preferred embodiment, the 
10 Kd is determined using the assay disclosed supra. 

The potency of the binding can be determined by a variety of standard methods 
based on the amount of ligand bound in the presence of different concentrations of the enhancer 
or augmentor. For example, a plot of labeled ligand binding versus enhancer concentration can 
be fit to the equation: 

15 S([E]) = S(0) + (S(0)*(Denhancerl)*[E]/([E]+ Ke„hancer) 

where "Kenhancer" IS the potency of the augmenting compound, and "Dgnhancer" is the fold- 
increase in binding of the labeled ligand obtained with addition of saturating amounts of the 
enhancing compound, [E] is the concentration of the enhancer. It will be understood that 
saturating amounts are the amount of enhancer such that further addition does not significantly 
20 increase the binding signal. Knowledge of "Kenhancer" is useful because it describes a 
concentration of the augmenting compound in a target cell that will result in a biological effect 
due to dysregulation of the PDZ-PL interaction. Typical therapeutic concentrations are 
between about 0.1 and about 100 Kgnhancer- 

25 . 

X. Global Analvsis of PDZ-PL Interactions 

As described supra^ the present invention provides powerful methods for 
analysis ofPDZ-ligand interactions, including high-throughputmethods such as the"G" assay 
and affinity assays described supra. In one embodiment of the invention, the affinity is 
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determined for a particular ligand and a plurality of PDZ proteins. Typically the plurality is 
at least 5, and often at least 25, or at least 40 different PDZ proteins. In a preferred 
embodiment, the plurality of different PDZ proteins are from a particular tissue (e.g., central 
nervous system, spleen, cardiac muscle, kidney) or a particular class or type of cell, (e.g., a 
hematopoietic cell, a lymphocyte, a neuron) and the like. In a most preferred embodiment, the 
plurality of different PDZ proteins represents a substantial fraction (e.g., typically a majority, 
more often at least 80%) of all of the PDZ proteins known to be, or suspected of being, 
expressed in the tissue or cell(s), e.g., all of the PDZ proteins known to be present in 
lymphocytes. In an embodiment, the plurality is at least 50%, usually at least 80%, at least 
90% or all of the PDZ proteins disclosed herein as being expressed in hematopoietic cells. 

In one embodiment of the invention, the binding of a ligand to the plurality of 
PDZ proteins is determined. Using this method, it is possible to identify a particular PDZ 
domain bound with particular specificity by the ligand. The binding may be designated as 
"specific" if the affinity of the ligand to the particular PDZ domain is at least 2-fold that of the 
binding to other PDZ domains in the plurality (e.g., present in that cell type). The binding is 
deemed "very specific" if the affinity is at least 10-fold higher than to any other PDZ in the 
plurality or, alternatively, at least 10-fold higher than to at least 90%, more often 95% of the 
other PDZs in a defined plurality. Similarly, the binding is deemed "exceedingly specific" if it 
is at least 100-fold higher. For example, a ligand could bind to 2 different PDZs with an affinity 
of 1 uM and to no other PDZs out of a set 40 with an affinity of less than 100 uM. This 
would constitute specific binding to those 2 PDZs. Similar measures of specificity are used to 
describe binding of a PDZ to a plurality of PLs. 

It will be recognized that high specificity PDZ-PL interactions represent 
potentially more valuable targets for achieving a desired biological effect. The ability of an 
inhibitor or enhancer to act with high specificity is often desirable. In particular, the most 
specific PDZ-ligand interactions are also the best therapeutic targets, allowing specific 
inhibition of the interaction. 

Thus, in one embodiment, the inventionprovides a method of identifying a high 
specificity interaction between a particular PDZ domain and a ligand known or suspected of 
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binding at least one PDZ domain, by providing a plurality of different immobilized 
polypeptides, each of said polypeptides comprising a PDZ domain and a non-PDZ domain; 
determining the affinity of the ligand for each of said polypeptides, and comparing the affinity 
of binding of the ligand to each of said polypeptides, wherein an interaction between the ligand 
5 and a particular PDZ domain is deemed to have high specificity when the ligand binds an 
immobilized polypeptide comprising the particular PDZ domain with at least 2-fold higher 
affinity than to immobilized polypeptides not comprising the particular PDZ domain. 

In a related aspect, the affinity of binding of a specific PDZ domain to a 
plurality of ligands (or suspected ligands) is determined. For exeimple, in one embodiment, the 

1 0 invention provides a method of identifying a high specificity interaction between a PDZ domain 
and a particular ligand known or suspected of binding at least one PDZ domain, by providing 
an immobilized polypeptide comprising the PDZ domain and a non-PDZ domain; determining 
the affinity of each of a plurality of ligands for the polypeptide, and comparing the affinity of 
binding of each of the ligands to the polypeptide, wherein an interaction between a particular 

15 ligand and the PDZ domain is deemed to have high specificity when the ligand binds an 
immobilized polypeptide comprising the PDZ domain with at least 2-fold higher affinity than 
other ligands tested. Thus, the binding may be designated as "specific" if the affinity of the 
PDZ to the particular PL is at least 2-fold that of the binding to other PLs in the plurality (e.g., 
present in that cell type). The binding is deemed "very specific" if the affinity is at least 10- 

20 fold higher than to any other PL in the plurality or, alternatively, at least 10-fold higher than 
to at least 90%, more often 95% of the other PLs in a defined plurality. Similarly, the binding 
is deemed "exceedingly specific" if it is at least 100-fold higher. Typically the plurality is at 
least 5 different ligands, more often at least 10. 

25 A. Use of Array for Global Predictions 

One discovery of the present inventors relates to the important and extensive 
roles played by interactions between PDZ proteins and PL proteins, particularly in the 
biological function of hematopoietic cells and other cells involved in the immune response. 
Further, it has been discovered that valuable information can be ascertained by analysis (e.g., 
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simultaneous analysis) of a large number of PDZ-PL interactions. In a preferred embodiment, 
the analysis encompasses all of the PDZ proteins expressed in a particular tissue (e.g., spleen) 
or type or class of cell (e.g., hematopoietic cell, neuron, lymphocyte, B cell, T cell and the like). 
Alternatively, the analysis encompasses at least about 5, or at least about 10, or at least about 
12, or at least about 15 and often at least 50 different polypeptides, up to about 60, about 80, 
about 100, about 150, about 200, or even more different polypeptides; or a substantial fraction 
(e.g., typically a majority, more often at least 80%) of all of the PDZ proteins known to be, or 
suspected of being, expressed in the tissue or cell(s), e.g., all of the PDZ proteins known to be 
present in lymphocytes. 

It will be recognized that the arrays and methods of the invention are directed 
to analyze of PDZ and PL interactions, and involve selection of such proteins for analysis. 
While the devices and methods of the invention may include or involve a small number of 
control polypeptides, they typically do not include significant numbers of proteins or fusion 
proteins that do not include either PDZ or PL domains (e.g., typically, at least about 90% of 
the arrayed or immobilized polypeptides in a method or device of the inventionis a PDZ or PL 
sequence protein, more often at least about 95%, or at least about 99%). 

It will be apparent from this disclosure that analysis of the relatively large 
number of different interactions preferably takes place simultaneously. In this context, 
"simultaneously" means that the analysis of several different PDZ-PL interactions (or the 
effect of a test agent on such interactions) is assessed at the same time. Typically the analysis 
is carried out in a higli throughput (e.g., robotic) fashion. One advantage of this method of 
simultaneous analysis is that it permits rigorous comparison of multiple different PDZ-PL 
interactions. For example, as explained in detail elsewhere herein, simultaneous analysis (and 
use of the arrays described infra) facilitates, for example, the direct comparison of the effect of 
an agent (e.g., an potential interaction inhibitor) on the interactions between a substantial 
portion of PDZs and/or PLs in a tissue or cell. 

Accordingly, in one aspect, the invention provides an array of immobilized 
polypeptide comprising the PDZ domain and a non-PDZ domain on a surface. Typically, the 
array comprises at least about 5, or at least about 10, or at least about 12, or at least about 15 
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and often at least 50 different polypeptides. In one preferred embodiment, the different PDZ 
proteins are from a particular tissue (e.g., central nervous system, spleen, cardiac muscle, 
kidney) or a particular cl£iss or type of cell, (e.g., a hematopoietic cell, a lymphocyte, a neuron) 
and the like. In a most preferred embodiment, the plurality of different PDZ proteins 
5 represents a substantial fraction (e.g.^ typically a majority, more often at least 60%, 70% or 
80%) of all of the PDZ proteins known to be, or suspected of being, expressed in the tissue or 
cell(s), e.g., all of the PDZ proteins known to be present in lymphocytes. 

Certain embodiments are arrays which include a plurality, usually at least 5,10, 
25, 50 PDZ proteins present in a particular cell of interest. In this context, "array" refers to 

10 an ordered series of immobilized polypeptides in which the identity of each polypeptide is 
associated with its location. In some embodiments the plurality of polypeptides are arrayed 
in a "common" area such that they can be simultaneously exposed to a solution (e.g., containing 
a ligand or test agent). For example, the plurality of polypeptides can be on a slide, plate or 
similar surface, which may be plastic, glass, metal, silica, beads or other surface to which 

15 proteins can be immobilized. In a different embodiment, the different immobilized 
polypeptides are situated in separate areas, such as different wells of multi-well plate (e.g., a 
24- well plate, a 96- well plate, a 384 well plate, and the like). It will be recognized that a similar 
advantage can be obtained by using multiple arrays in tandem. 

20 B. Analysis of PDZ-PL Inhibition Profile 

In one aspect, the invention provides a method for determining if a test 
compound inhibits any PDZ-ligand interaction in large set of PDZ-ligand interaction (e.g., a ^ 
plurality of the PDZ-ligands interactions described in Table 2; a majority of the PDZ-ligands 
identified in a particular cell or tissue as described supra (e.g., lymphocytes) and the like. In 

25 one embodiment, the PDZ domains of interest are expressed as GST-PDZ fusion proteins and 
immobilized as described herein. For each PDZ domain, a labeled ligand that binds to the 
domain with a known affinity is identified as described herein. 

For any known or suspected modulator (e.g., inhibitor) of a PDL-PL 
interaction(s), it is useful to know which interactions are inhibited (or augmented). For 
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example, an agent that inhibits all PDZ-PL interactions in a cell (e.g., a lymphocyte) will have 
different uses than an agent that inhibits only one, or a small number, of specific PDZ-PL 
interactions. The profile of PDZ interactions inhibited by a particular agent is referred to as 
the "inhibition profile" for the agent, and is described in detail below. The profile of PDZ 
5 interactions enhanced by a particular agent is referred to as the "enhancement profile" for the 
agent. It will be readily apparent to one of skill guided by the description of the inhibition 
profile how to determine the enhancement profile for an agent. The present invention provides 
methods for determining the PDZ interaction (inhibition/enhancement) profile of an agent in a 
single assay. 

10 In one aspect, the invention provides a method for determining the PDZ-PL 

inhibition profile of a compound by providing (i) a plurality of different immobilized 
polypeptides, each of said polypeptides comprising a PDZ domain and a non-PDZ domain and 
(ii) a plurality of corresponding ligands, wherein each ligand binds at least one PDZ domain in 
(i), then contacting each of said immobilized polypeptides in(i) with a corresponding ligand in 

15 (ii) in the presence and absence of a test compound, and determining for each polypeptide- 
ligand pair whether the test compound inhibits binding between the immobilized poly peptide 
and the corresponding ligand. 

Typically the plurality is at least 5, and ofteri at least 25, or at least 40 different 
PDZ proteins. In a preferred embodiment, the plurality of different ligands and the plurality 

20 of different PDZ proteins are from the same tissue or a particular class or type of cell, e.g., a 
hematopoietic cell, a lymphocyte, a neuron and the like. In a most preferred embodiment, the 
plurality of different PDZs represents a substantial fraction (e.g., at least 80%) of all of the 
PDZs known to be, or suspected of being, expressed in the tissue or cell(s), e.g., all of the 
PDZs known to be present in lymphocytes (for example, at least 80%, at least 90% or all of 

25 the PDZs disclosed herein as being expressed in hematopoietic cells). 

In one embodiment, the inhibition profile is determined as follows: A plurality 
(e.g., all known) PDZ domains expressed in a cell (e.g., lymphocytes) are expressed as GST- 
fusion proteins and immobilized without altering their ligand binding properties as described 
supra. For each PDZ domain, a labeled ligand that binds to this domain with a known affinity 

75 



is identified. If the set of PDZ domains expressed in lymphocytes is denoted by {Pl...Pn}, 
any given PDZ domain Pi binds a (labeled) ligand Li with affinity Kdi. To determine the 
inhibition profile for a test agent "compound X" the "G" assay {supra) can be performed as 
follows in 96- well plates with rows A-H and columns 1-12. Column 1 is coated with PI and 
5 washed. The corresponding ligand LI is added to each washed coated well of column 1 at a 
concentration 0.5 K^l with (rows B, D, F, H) or without (rows A, C, E, F) between about 1 
and about 1000 uM) of test compound X. Column 2 is coated with P2, and L2 (at a 
concentration 0.5 Kd2) is added with or without inhibitor X. Additional PDZ domains and 
ligands are similarly tested. 
1 0 Compound X is considered to inhibit the binding of Li to Pi if the average signal 

in the wells of column i containing X is less than half the signal in the equivalent wells of the 
column lacking X. Thus, in this single assay one determines the full set of lymphocyte PDZs 
that are inhibited by compound X. 

In some embodiments, the test compound X is a mixture of compounds, such 
15 as the product of a combinatorial chemistry synthesis as described supra. In some 
embodiments, the test compound is known to have a desired biological effect, and the assay is 
used to determine the mechanism of action (i.e., if the biological effect is due to modulating a 
PDZ-PL interaction). 

It will be apparent that an agent that modulates only one, or a few PDZ-PL 
20 interactions, in a panel (e.g., a panel of all known PDZs lymphocytes, a panel of at least 10, at 
least 20 or at least 50 PDZ domains) is a more specific modulator than an agent that modulate 
many or most interactions. Typically, an agent that modulates less than 20% of PDZ domains 
in a panel (e.g.. Table 2) is deemed a "specific" inhibitor, less than 6% a "very specific" 
inhibitor, and a single PDZ domain a "maximally specific" inhibitor. 
25 It will also be appreciated that "compound X" may be a composition containing 

mixture of compounds (e.g., generated using combinatorial chemistry methods) rather than a 
single compound. 

Several variations of this assay are contemplated: 

In some alternative embodiments, the assay above is performed using varying 
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concentrations of the test compound X, rather than fixed concentration. This allows 
determination of the Ki of the X for each PDZ as described above. 

In an ahernative embodiment, instead of pairing each PDZ Pi with a specific 
labeled ligand Li, a mixture of different labeled ligands is created that such that for every PDZ 
5 at least one of the ligands in the mixture binds to this PDZ sufficiently to detect the binding in 
the "G" assay. This mixture is then used for every PDZ domain. 

In one embodiment, compound X is known to have a desired biological effect, 
but the chemical mechanism by which it has that effect is unknown. The assays of the 
invention can then be used to determine if compound X has its effect by binding to a PDZ 
10 domain. 

In one embodiment, PDZ-domain containing proteins are classified in to groups 
based on their biological function, e.g. into those that regulate chemotaxis versus those that 
regulate transcription. An optimal inhibitor of a particular fimction (e.g., including but not 
limited to an anti-chemotactic agent, an anti-T cell activation agent, cell-cycle control, vesicle 

15 transport, apoptosis, etc.) will inhibit multiple PDZ-ligand interactions involved in the function 
(e.g., chemotaxis, activation) but few other interactions. Thus, the assay is used in one 
embodiment in screening and design of a drug that specifically blocks a particular function. For 
example, an agent designed to block chemotaxis might be identified because, at a given 
concentration, the agent inhibits 2 or more PDZs involved in chemotaxis but fewer than 3 other 

20 PDZs, or that inhibits PDZs involved in chemotaxis with a Ki > 10-fold better than for other 
PDZs. Thus, the invention provides a method for identifying an agent that inhibits a first 
selected PDZ-PL interaction or plurality of interactions but does not inhibit a second selected 
PDZ-PL interaction or plurality of interactions. The two (or more) sets of interactions can be 
selected on the basis of the known biological function of the PDZ proteins, the tissue 

25 specificity of the PDZ proteins, or any other criteria. Moreover, the assay can be used to 
determine effective doses (i.e,, drug concentrations)that result in desired biological effects while 
avoiding undesirable effects. 

C. Side Effects of PDZ-PL Modulator Interactions 
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In a related embodiment, the invention provides a method for determining likely 
side effects of a therapeutic that inhibits PDZ-ligand interactions. The method entails 
identifying those target tissues, organs or cell types that express PDZ proteins and ligands that 
are disrupted by a specified inhibitor. If, at a therapeutic dosage, a drug intended to have an 
effect in one organ system (e.g., hematopoietic system) disrupts PDZ-PL interactions in a 
different system (e.g., CNS) it can be predicted that the drug will have effects ("side effects") 
on the second system. It will be apparent that the information obtained from this assay will 
be useful in the rational design and selection of drugs that do not have the side-efifect. 

In one embodiment, for example, a comprehensive PDZ protein set is obtained. 
A "perfectly comprehensive" PDZ protein set is defined as the set of all PDZ proteins 
expressed in the subject animal (e.g., humans). A comprehensive set may be obtained by 
analysis of, for example, the human genome sequence. However, a "perfectly comprehensive" 
set is not required and any reasonably large set of PDZ domain proteins (e.g., the set of all 
known PDZ proteins; or the set listed in TABLE 6) will provide valuable information. 

In one embodiment, the method involves some of all of the following steps: 

a) For each PDZ protein, determine the tissues in which it is highly 
expressed. This can be done experimentally although the information generally will be available 
in the scientific literature; 

b) For each PDZ protein (or as many as possible), identify the cognate 
PL(s) bound by the PDZ protein; 

c) Determine the Ki at which the test agent inhibits each PDZ-PL 
interaction, using the methods described supra; 

d) From this information it is possible to calculate the pattern of PDZ-PL 
interactions disrupted at various concentrations of the test agent 

By correlating the set of PDZ-PL interactions disrupted with the expression pattern of the 
members of that set, it will be possible to identify the tissues likely affected by the agent. 

Additional steps can also be carried out, including determining whether a 
specified tissue or cell type is exposed to an agent following a particular route of 
administration. This can be determined using basis pharmacokinetic methods and principles. 
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D. Modulation of Activities 

The PDZ binding moieties and PDZ protein -PL protein binding antagonists of 
the invention are used to modulate biological activities or functions of cells (e.g., hematopoietic 
5 cells, such as T cells and B cells and the like), endothelial cells, and other immune system cells, 
as described herein, and for treatment of diseases and conditions in human and nonhuman 
animals (e.g., experimental models). Exemplary biological activities are listed supra. 

When administered to patients, the compounds of the invention (e.g., PL-PDZ 
interaction inhibitors) are useful for treating (ameliorating symptoms of) a variety of diseases 

10 and conditions, including diseases characterized by inflammatory and humoral immune 
responses, e.g., inflammation, allergy (e.g., systemic anaphylaxis, hypersensitivity responses, 
drug allergies, insect sting allergies; inflammatory bowel diseases, ulcerative colitis, ileitis and 
enteritis; psoriasis and inflammatory dermatoses^ scleroderma; respiratory allergic diseases such 
as asthma, allergic rhinitis, hypersensitivity lung diseases, and the like vasculitis, rh 

15 incompatibility, transfusion reactions, drug sensitivities, PIH, atopic dermatitis, eczema, 
rhinnitis; autoimmune diseases, such as arthritis (rheumatoid and psoriatic), multiple sclerosis, 
systemic lupus erythematosus, insulin-dependent diabetes, glomerulonephritis, scleroderma, 
MCTD, IDDM, Hashimoto thyroiditis, Goodpasture syndrome, psoriasis and the like, 
osteoarthritis, polyarthritis, graft rejection (e.g., allograft rejection, e.g., renal allograft rejection, 

20 graft- vs-host disease, transplantationrejection (cardiac, kidney, lung, liver, small bowel, comea, 
pancreas, cadaver, autologous, bone marrow, xenotransplantation)), atherosclerosis, 
angiogenesis-dependent disorders, cancers (e.g., melanomas and breast cancer, prostrate cancer, 
leukemias, lymphomas, metastatic disease), infectious diseases (e.g., viral infection, such as 
HIV, measles, parainfluenza, virus-mediated cell fusion,), ischemia (e.g., post-myocardial 

25 infarction complications, joint injury, kidney, scleroderma). 

E. Agonists and Antagonists of PDZ-PL Interactions 

As described herein, interactions between PDZ proteins and PL proteins in cells 
(e.g., hematopoietic cells, e.g., T cells and B cells) may be disrupted or inhibited by the 
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administration of inhibitors or antagonists. Inhibitors can be identified using screening assays 
described herein. In embodiment, the motifs disclosed herein are used to design inhibitors. In 
some embodiments, the antagonists of the invention have a structure (e.g., peptide sequence) 
based on the C-terminal residues of PL-domain proteins listed in TABLE 3. In some 
5 embodiments, the antagonists of the invention have a structure (e.g., peptide sequence) based 
on a PL motif disclosed herein. 

The PDZ/PL antagonists and antagonists of the invention may be any of a large 
variety of compounds, both naturally occurring and synthetic, organic and inorganic, and 
including polymers (e.g., oligopeptides, polypeptides, oligonucleotides, and polynucleotides), 

10 small molecules, antibodies, sugars, fatty acids, nucleotides and nucleotide analogs, analogs of 
naturally occurring structures (e.g., peptide niimetics, nucleic acid analogs, and the like), and 
numerous other compounds. Although, for convenience, the present discussion primarily refers 
antagonists of PDZ-PL interactions, it will be recognized that PDZ-PL interaction agonists can 
also be use in the methods disclosed herein. 

15 In one aspect, the peptides and peptide mimetics or analogues of the invention 

contain an amino acid sequence that binds a PDZ domain in a cell of interest. In one 
embodiment, the antagonists comprise a peptide that has a sequence corresponding to the 
carboxy-terminal sequence of a PL protein listed in TABLE 3, e.g., a peptide listed TABLE 3. 
Typically, the peptide comprises at least the C-terminal two (3), three (3) or four (4) residues 

20 of the PL protein, and often the inhibitory peptide comprises more than four residues (e.g., at 
least five, six, seven, eight, nine, ten, twelve or fifteen residues) from the PL protein C- 
terminus. 

In some embodiments, the inhibitor is a peptide, e.g., having a sequence of a PL 
C-terminal protein sequence. 
25 In some embodiments, the antagonist is a fusion protein comprising such a 

sequence. Fusion proteins containing a transmembrane transporter amino acid sequence are 
particularly useful. 

In some embodiments, the inhibitor is conserved variant of the PL C-terminal 
protein sequence having inhibitory activity. 
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In some embodiments, the antagonist is a peptide mimetic of a PL C-terminal 

sequence. 

In some embodiments, the inhibitor is a small molecule (i.e., having a molecular 
weight less than 1 kD). See, e.g. Section 6.5.4, infra. 

5 F. Peptide Antagonists 

In one embodiment, the antagonists comprise a peptide that has a sequence of 
a PL protein carboxy-terminus listed in TABLE 3. The peptide comprises at least the C- 
terminal two (2) residues of the PL protein, and typically, the inhibitory peptide comprises 
more than two residues (e.g. at least three, four, five, six, seven, eight, nine, ten, twelve or 
10 fifteen residues) from the PL protein C-terminus. The peptide may be any of a variety of 
lengths (e.g., at least 2, at least 3, at least 4, at least 5, at least 6, at least 8, at least 10, or at least 
20 residues) and may contain additional residuesnot from the PL protein. It will be recognized 
that short PL peptides are sometime used in the rational design of other small molecules with 
similar properties. 

1 5 Although most often, the residues shared by the inhibitory peptide with the PL 

protein are found at the C-terminus of the peptide. However, in some embodiments, the 
sequence is internal. Similarly, in some cases, the inhibitory peptide comprises residues from 
a PL sequence that is near, but not at tlie c-terminus of a PL protein (see. Gee et al., 1998, J 
Biological Chem. 273 :2 1 980-87). 

20 Sometime the PL protein carboxy-terminus sequence is referred to as the "core 

PDZ motif sequence" referring to the ability of the short sequence to interact with the PDZ 
domain. For example, in an embodiment, the "core PDZ motif sequence" contains the last four 
C-terminus amino acids. As described above, the four amino acid core of a PDZ motif sequence 
may contain additional amino acids at its amino terminus to further increase its binding affinity 

25 and/or stability. Thus, in one embodiment, the PDZ motif sequence peptide can be from four 
amino acids up to 15 amino acids. It is preferred that the length of the sequence to be 6-10 
amino acids. More preferably, the PDZ motif sequence contains 8 amino acids. Additional 
amino acids at the amino terminal end of the core sequence may be derived from the natural 
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sequence in each hematopoietic cell surface receptor or a synthetic linker. The additional amino 
acids may also be conservatively substituted. When the third residue from the C-terminus is 
S, T or Y, this residue may be phosphorylated prior to the use of the peptide. 

In some embodiments, the peptide and nonpeptide inhibitors of the are small, 
e.g., fewer than ten amino acid residues in length if a peptide. Further, it is reported that a 
limited number of ligand amino acids directly contact the PDZ domain (generally less than 
eight) (Kozlov etal., 2000, Biochemistry 39, 2572; Doyle et al., 1996, Cell 85, 1067) and that 
peptides as short as the C-terminal three amino acids often retain similar binding properties to 
longer (> 15) amino acids peptides (Yanagisawa et al., 1997, J. Biol. Chem. 272, 8539). 

G. Peptide Variants 

Having identified PDZ binding peptides and PDZ-PL interaction inhibitory 
sequences, variations of these sequences can be made and the resulting peptide variants can be 
tested for PDZ domain binding or PDZ-PL inhibitory activity. In embodiments, the variants 
have the same or a different ability to bind a PDZ domain as the parent peptide. Typically, 
such amino acid substitutions are conservative, i.e., the amino acid residues are replaced with 
other amino acid residues having physical and/or chemical properties similar to the residues 
they are replacing. Preferably, conservative amino acid substitutions are those wherein an 
amino acid is replaced with another amino acid encompassed within the same designated class. 

H. Peptide Mimetics 

Having identified PDZ binding peptides and PDZ-PL interaction inhibitory 
sequences, peptide mimetics can be prepared using routine methods, and the inhibitory activity 
of the mimetics can be confirmed using the assays of the invention. Thus, in some 
embodiments, the antagonist is a peptide mimetic of a PL C-terminal sequence. The skilled 
artisan will recognize that individual synthetic residues and polypeptides incorporating 
mimetics can be synthesized using a variety of procedures and methodologies, which are well 
described in the scientific and patent literature, e.g., Organic Syntheses Collective Volumes, 
Oilman et al. (Eds) John Wiley & Sons, Inc., NY. Polypeptides incorporating mimetics can 
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also be made using solid phase synthetic procedures, as described, e.g., by Di Marchi, et al., 
U.S. Pat. No. 5,422,426. Mimetics of the invention can also be synthesized using 
combinatorial methodologies. Various techniques for generationof peptide £ind peptidomimetic 
libraries are well known, and include, e.g., multipin, tea bag, and split-couple-mix techniques; 
5 see, e.g., al-Obeidi (1998) Mol. Biotechnol. 9:205-223; Hruby (1997) Curr. Opin. Chem. Biol. 
1:114-119; Ostergaard (1997) Mol. Divers. 3:17-27; Ostresh (1996) Methods Enzymol. 
267:220-234. 

1. Small Molecules 

In some embodiments, the inhibitor is a small molecule (i.e., having a molecular 
10 weight less than 1 kD). Methods for screening small molecules are well known in the art and 
include those described supra, 

XII. Preparation of Peptides 

A. Chemical Synthesis 

15 The peptides of the invention or analogues thereof, may be prepared using 

virtually any art-known technique for the preparation of peptides and peptide analogues. For 
example, the peptides may be prepared in linear form using coriventionalsolution or solid phase 
peptide syntheses and cleaved from the resin followed by purification procedures (Creighton, 
1983, Protein Structures And Molecular Principles, W.H. Freeman and Co., N.Y.). Suitable 

20 procedures for synthesizing the peptides described herein are well known in the art. The 
composition of the synthetic peptides may be confirmed by amino acid analysis or sequencing 
(e.g., the Edman degradation procedure and mass spectroscopy). 

In addition, analogues and derivatives of the peptides can be chemically 
synthesized. The linkage between each amino acid of the peptides of the invention may be an 

25 amide, a substituted amide or an isostere of amide. Nonclassical amino acids or chemical amino 
acid analogues can be introduced as a substitution or addition into the sequence. Non-classical 
amino acids include, but are not limited to, the D-isomers of the common amino acids, a-amino 
isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, yAbu, e-Ahx, 6-amino 
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hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, 
norvaline, hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine, t-butylalanine, 
phenylglycine, cyclohexylalanine, p-alanine, fluoro-£imino acids, designer amino acids such as 
P-methyi amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid 
5 analogues in general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary), 

B. Recombinant Synthesis 

If the peptide is composed entirely of gene-encoded amino acids, or a portion 
of it is so composed, the peptide or the relevant portion may also be synthesized using 
conventional recombinant genetic engineering techniques. For recombinant production, a 

10 polynucleotide sequence encoding a linear form of the peptide is inserted into an appropriate 
expression vehicle, /. e. , a vector which contains the necessary elements for the transcription and 
translation of the inserted coding sequence, or in the case of an RNA viral vector, the necessary 
elements for replication and translation. The expression vehicle is then transfected into a 
suitable target cell which will express the peptide. Depending on the expression system used, 

15 the expressed peptide is then isolated by procedures well-established in the art. Methods for 
recombinant protein and peptide production are well known in the art {see^ e.g., Maniatis et ah , 
1989, Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y.; and 
Ausubel et al,^ 1989, Current Protocols in Molecular Biology, Greene Publishing Associates 
and Wiley Interscience, N.Y.). 

20 A variety of host-expression vector systems may be utilized to express the 

peptides described herein. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophageDN A or plasmidDNA expression vectors 
containing an appropriate coding sequence; yeast or filamentous fungi transformed with 
recombinemtyeast or fungi expression vectors containing an appropriate coding sequence; insect 

25 cell systems infected with recombinant virus expression vectors (^.g., baculovirus) containing 
an appropriate coding sequence; plant cell systems infected with recombinant virus expression 
vectors (e.g., cauliflower mosaic virus or tobacco mosaic virus) or tremsformed with 
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recombinant plasmid expression vectors (e.g., Ti plasmid) containing an appropriate coding 
sequence; pr animal cell systems. 

The expression elements of the expression systems vary in their strength and 

specificities. Depending on the host/vector system utilized, any of a number of suitable 

-J 

transcription and translation elements, including constitutive and inducible promoters, may be 
used in the expression vector. For example, when cloning in bacterial systems, inducible 
promoters such as pL of bacteriophage X, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the 
like may be used; when cloning in insect cell systems, promoters such as the baculovirus 
polyhedron promoter may be used; when cloning in plant cell systems, promoters derived from 
the genome of plant cells^e.g., heat shock promoters; the promoter for the sm£ill subunit of 
RUBISCO; the promoter for the chlorophyll a/b binding protein) or from plant viruses {e.g., 
the 35 S RNA promoter of CaMV; the coat protein promoter of TMV) may be used; when 
cloning in mammalian cell systems, promoters derived from the genome of mammalian cells 
(e.g., metallothioneinpromoter) or frommammalian viruses (e.g., the adenovirus late promoter; 
the vaccinia virus 7.5 K promoter) may be used; when generating cell lines that contain multiple 
copies of expression product, SV40-, BPV- and EBV-based vectors may be used with an 
appropriate selectable marker. 

In cases where plant expression vectors are used, the expression of sequences 
encoding the peptides of the invention may be driven by any of a number of promoters. For 
example, viral promoters such as the 35S RNA and 19S RNA promoters of CaMV (Brisson 
et al, 1984, Nature 310:51 1-514), or the coat protein promoter of TMV (Takamatsu et al.^ 
1987, EMBO J. 6:307-311) may be used; alternatively, plant promoters such as the small 
subunit of RUBISCO (Coruzzi etal, 1984,EMBO J. 3:1671-1680;Broglieer a/., 1984,Science 
224:838-843) or heat shock promoters, e.g., soybean hspl7.5-E or hspl7.3-B (Gurley etal, 
1986, Mol. Cell. Biol. 6:559-565) may be used. These constructs can be introduced into 
planleukocy tes using Ti plasmids, Ri plasmids, plant virus vectors, direct DN A transfomiation, 
microinjection, electroporation, etc. For reviews of such techniques 5ee, e.g., Weissbach & 
Weissbach, 1988, Methods for Plant Molecular Biology, Academic Press, NY, Section VIII, 
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pp. 421-463; and Grierson & Corey, 1988, Plant Molecular Biology , 2d Ed., Blackie, London, 
Ch. 7-9. 

In one insect expression system that may be used to produce the peptides of 
the invention, Autographa californica nnclear polyhidrosis virus (AcNPV) is used as a vector 
5 to express the foreign genes. The virus grows in Spodoptera frugiperda cells. A coding 
sequence may be cloned into non-essential regions (for example the polyhedron gene) of the 
virus and placed under control of an AcNPV promoter (for example, the polyhedron promoter). 
Successful insertion of a coding sequence will result in inactivationof the polyhedron gene and 
production of non-occluded recombinant virus (/.^., virus lacking theproteinaceous coat coded 

10 for by the polyhedron gene). These recombinant viruses are then used to infect Spodoptera 
frugiperda cells in which the inserted gene is expressed, (e.g., see Smith et al, 1983, J. Virol. 
46:584; Smith, U.S. Patent No. 4,215,051). Further examples of this expression system may 
be found in Current Protocols in Molecular Biology, Vol. 2, Ausubel et al.^ eds., Greene 
Publish. Assoc. & Wiley Interscience. 

15 In mammalian host cells, a number of viral based expression systems may be 

utilized. In cases where an adenovirus is used as an expression vector, a coding sequence may 
be ligated to an adenovirus transcription/translationcontrol complex, e.g. , the late promoter and 
tripartite leader sequence. This chimeric gene may then be inserted in the adenovirus genome 
by in vitro or in vivo recombination. Insertion in a non-essential region of the viral genome 

20 {e.g, , region El or E3) will result in a recombinant virus that is viable and capable of expressing 
peptide in infected hosts, {e.g.. See Logan & Shenk, 1984, Proc. Natl. Acad. Sci. USA 8 1 :3655- 
3659). Alternatively, the vaccinia?. 5 Kpromoter may be used, {see, e.g, Mackette^ al, 1982, 
Proc. Natl. Acad. Sci. USA 79:7415-7419; Mackett et al., 1984, J. Virol. 49:857-864; Panicali 
etal,, 1982, Proc. Natl. Acad. Sci. USA 79:4927-4931). 

25 Other expression systems for producing linear peptides of the invention will be 

apparent to those having skill in the art. 

Purification of the Peptides and Peptide Analogues 
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The peptides and peptide analogues of the invention can be purified by art- 
known techniques such as high performance liquid chromatography, ion exchange 
chromatography, gel electrophoresis, affinity chromatography and the hke. The actual 
conditions used to purify a particular peptide or analogue will depend, in part, on factors such 
5 as net charge, hydrophobicity, hydrophilicity, etc., and will be apparent to those having skill 
in the art. The purified peptides can be identified by assays based on their physical or 
functional properties, including radioactive labelmg followed by gel electrophoresis, 
radioimmuno-assays, ELISA, bioassays, and the like. 

For affinity chromatography purification, any antibody which specifically 

1 0 binds the peptides or peptide analogues may be used. For the production of antibodies, various 
host animals, including but not limited to rabbits, mice, rats, etc., may be immunized by 
injection with a peptide. The peptide may be attached to a suitable carrier, such as BSA or 
KLH, by means of a side chain functional group or linkers attached to a side chain functional 
group. Various adjuvants may be used to increase the immunological response, depending on 

1 5 the host species, including but not limited to Freund's (complete and incomplete), mineral gels 
such as aluminum hydroxide, surface active substances such as ly solecithin, pluronic polyols, 
poly anions, peptides, oil emulsions, keyhole limpet hemocyanin,dinitrophenol,and potentially 
useful human adjuvants such as BCG (bacilli Calmette-Guerin) and Corynebacteriumparvum. 

Monoclonal antibodiesto a peptide may be prepared using any technique which 

20 provides for the production of antibody molecules by continuous cell lines in culture. These 
include but are not limited to the hybridoma technique originally described by Koehler and 
Milstein, 1975, Nature 256:495-497, the human B-cell hybridoma technique, Kosbor et al, 
1983, Immunology Today 4:72; CoXq etal, 1983, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030 
and the EBV-hybridoma technique (Cole et al, 1985, Monoclonal Antibodies and Cancer 

25 Therapy,. Alan R. Liss, Inc., pp. 77-96 (1985)). In addition, techniques developed for the 
production of "chimeric antibodies" (Morrison et al, 1984, Proc. Natl. Acad. Sci. U.S.A. 
81:6851-6855; Neuberger et al, 1984, Nature 312:604-608; Takeda et al, 1985, Nature 
314:452-454) by splicing the genes from a mouse antibody molecule of appropriate antigen 
specificity together with genes from a human antibody molecule of appropriate biological 
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activity can be used. Alternatively, techniques described for the production of single chain 
antibodies (U.S. Patent No. 4,946,778) can be adapted to produce peptide-specific single chain 
antibodies. 

Antibody fragments which contain deletions of specific binding sites may be 
5 generated by known techniques. For example, such fragments include but are not limited to 
F(ab')2 fragments, which can be produced by pepsin digestion of the antibody molecule and Fab 
fragments, which can be generated by reducing the disulfide bridges of the F(ab')2 fragments. 
Alternatively, Fab expression libraries may be constructed (Huse et aL, 1989, Science 
246:1275-1281) to allow rapid and easy identification of monoclonal Fab fragments with the 
10 desired specificity for the peptide of interest. 

The antibody or antibody fragment specific for the desired peptide can be 
attached, for example, to agarose, and the antibody-agarose complex is used in 
immunochromatography to purify peptides of the invention. See, Scopes, 1984, Protein 
Purification: Principles and Practice, Springer- Verlag New York, Inc., NY, Livingstone, 1974, 
15 Methods Enzymology: Immunoaffinity Chromatography of Proteins 34:723-731. 

XIII. Uses of PDZ Domain Binding and Antagonist Compounds 

As indicated in the Background section, PDZ domain-containing proteins are 
involved in a number of biological functions, including, but not limited to, vesicular trafficking, 
tumor suppression, protein sorting, establishment of membrane polarity, apoptosis, regulation 

20 of immune response and organization of synapse formation. In general, this family of proteins 
has a common function of facilitating the assembly of multi-protein complexes, often serving 
as a bridge between several proteins, or regulating the function of other proteins. Additionally, 
as also noted supra, these proteins are found in essentially all cell types. 

Consequently, modulationof these interactions can be utilized to control a wide 

25 variety of biological conditions and physiological conditions. In particular, modulation of 
interactions such as those disclosed herein can be utilized to control movement of vesicles 
within a cell, inhibition of tumor formation, as well as in the treatment of immune disorders, 
neurological disorders, muscular disorders, and intestinal disorders. 
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Certain compounds which modulate binding of the PDZ proteins and PL 
proteins can be used to inhibit leukocyte activation, which is manifested in measurable events 
including but not limited to, cytokine production, cell adhesion, expansion of cell numbers, 
apoptosis and cytotoxicity. Thus, some compounds of the invention can be used to treat 
5 diverse conditions associated with undesirable leukocyte activation, including but not limited 
to, acute and chronic inflammation, graft-versus-host disease, transplantation rejection, 
hypersensitivities and autoimmunity such as multiple sclerosis, rheumatoid arthritis, peridontal 
disease, systemic lupus erythematosis, juvenile diabetes mellitis, non-insulin-dependent 
diabetes, and allergies, and other conditions listed herein (see, e.g.. Section 6.4, supra). 
10 Thus, the invention also relates to methods of using such compositions in 

modulating leukocyte activation as measured by, for example, cytotoxicity, cytokine 
production, cell proliferation, and apoptosis. 

XIV. Formulation and Route of Administration 

15 A. Introduction of Agonists or Antagonists (e.g.. Peptides and Fusion Proteins^ 

into Cells 

In one aspect, the PDZ-PL antagonists of the invention are introduced into a 
cell to modulate (i.e., increase or decrease) a biological function or activity of the cell. Many 
small organic molecules readily cross the cell membranes (or can be modified by one of skill 

20 using routine methods to increase the ability of compounds to enter cells, e.g., by reducing or 
eliminating charge, increasing lipophilicity , conjugating the molecule to a moiety targeting a cell 
surface receptor such that after interacting withithe receptor). Methods for introducing larger 
molecules, e.g., peptides and fusion proteins are also well known, including, e.g., injection, 
liposome-mediated fusion, application of a hydrogel, conjugation to a targeting moiety 

25 conjugate endocytozed by the cell, electroporation, and the like). 

In one embodiment, the antagonist or agent is a fusion polypeptide or 
derivatized polypeptide. A fusion or derivatized protein may include a targeting moiety that 
increases the ability of the polypeptide to traverse a cell membrane or causes the polypeptide 
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to be delivered to a specified cell type (e.g., liver cells or tumor cells) preferentially or cell 
compartment (e.g., nuclear compartment) preferentially . Examples of targeting moieties include 
lipid tails, amino acid sequences such as antennapoedia peptide or a nuclear localization signal 
(NLS; e.g., Xenopus nucleoplasmin Robbins et al., 1991, Cell 64:615). 

In one embodiment of the invention, a peptide sequence or peptide analog 
determined to inhibit a PDZ domain-PL protein binding, in an assay , of the invention is 
introduced into a cell by linking the sequence to an amino acid sequence that facilitates its 
transport through the plasma membrane (a "transmembrane transporter sequence"). The 
peptides of the invention may be used directly or fused to a transmembrane transporter 
sequence to facilitate their entry into cells. In the case of such a fusion peptide, each peptide 
may be fused with a heterologous peptide at its amino terminus directly or by using a flexible 
poly linker such as the pentamer G-G-G-G-S repeated 1 to 3 times. Such linker has been used 
in constructing single chain antibodies (scFv) by being inserted between Vh and (Bird et al. , 
1988, Science 242:423-426; Huston et al., 1988,Proc. Natl Acad Set U.S.A, 85:5979-5883). 
The linker is designed to enable the correct interaction between two beta-sheets forming the 
variable region of the single chain antibody. Other linkers which may be used include Glu-Gly- 
Lys-Ser-Ser-Gly-Ser-Gly-Ser-Glu-Ser-Lys-Val-Asp(Chaudhary et al, 1990, Proc. Natl Acad 
ScL U,S.A. 87:1066-1070) and Lys-Glu-Ser-Gly-Ser-Val-Ser-Ser-Glu-Gki-Leu-Ala-Gln-Phe- 
Arg-Ser-Leu-Asp (Bird et al., 1988, Science 242:423-426). 

A number of peptide sequences have been described in the art as capable of 
facilitating the entry of a peptide linked to these sequences into a cell through the plasma 
membrane (Derossi etaL, 1998, Trends in Cell Biol. 8:84). For the purpose of this invention, 
such peptides are collectively referred to as transmembrane transporter peptides. Examples 
of these peptide include, but are not limited to, tat derived from HIV (Vives et aL, 1 997, J. Biol 
Chem. 272:16010; Nagahara et al., 1998, Nat. Med. 4:1449), antennapedia from Drosophila 
(Derossi et al., 1994, J. Biol Chem. 261:10444), VP22 from herpes simplex virus (Elliot and 
D'Hare, 1997, Cell 88:223-233), complementarity-determining regions (CDR) 2 and 3 of anti- 
DNA antibodies (Avrameaset al., 1998, Proc. Natl Acad. ScL U.S.A., 95:5601-5606), 70 KDa 
heat shock protein (Fujihara, 1999, EMBOJ. 18:41 1-419) and transportan (Poogaet aL, 1998, 
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FASEB J. 12:67-77). In a preferred embodiment of the invention, a truncated HIV tat peptide 
having the sequence of G YGRKKRRQRRRG is used. 

It is preferred that a transmembrane transporter sequence is fused to a 
hematopoietic cell surface receptor carboxyl terrninal sequence at its amino-terminus with or 
without a linker. Generally, the C-terminus of a PDZ motif sequence (PL sequence) must be 
free in order to interact with a PDZ domain. The transmembrane transporter sequence may be 
used in whole or in part as long as it is capable of facilitating entry of the peptide into a cell. 

In an altemate embodiment of the invention, a hematopoietic cell surface 
receptor C-terminal sequence may be used alone when it is delivered in a manner that allows 
its entry into cells in the absence of a transmembrane transporter sequence. For example, the 
peptide may be delivered in a liposome formulation or using a gene therapy approach by 
delivering a coding sequence for the PDZ motif alone or as a fusion molecule into a target cell. 

The compounds of the of the invention may also be administered via liposomes, 
which serve to target the conjugates to a particular tissue, such as lymphoid tissue, or targeted 
selectively to infected cells, as well as increase the half-life of the peptide composition. 
Liposomes include emulsions, foams, micelles, insoluble monolayers, liquid crystals, 
phospholipid dispersions, lamellar layers and the like. In these preparations the peptide to be 
delivered is incorporated as part of a liposome, alone or in conjunction with a molecule which 
binds to, e.g., a receptor prevalent among lymphoid cells, such as monoclonal antibodies which 
bind to the CD45 antigen, or with other therapeutic or immunogenic compositions. Thus, 
liposomes filled with a desired peptide or conjugate of the invention can be directed to the site 
of lymphoid cells, where the liposomes then deliver the selected inhibitor compositions. 
Liposomes for use in the invention are formed from standard vesicle-forming lipids, which 
generally include neutral and negatively charged phospholipids and a sterol, such as cholesterol. 
The selection of lipids is generally guided by consideration of, e.g., liposome size, acid lability 
and stability of the liposomes m the blood stream. A variety of methods are available for 
preparing liposomes, as described in, e.g., Szoka et al., Ann. Rev. Biophys. Bioeng. 9:467 
(1980), U.S. Pat. Nos. 4,235,871, 4,501,728 and 4,837,028. 

The targeting of liposomes using a variety of targeting agents is well known in 
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the art (see, e.g., U.S. Patent Nos. 4,957,773 and 4,603,044). For targeting to the immune cells, 
a ligand to be incorporated into the liposome can include, e.g., antibodies or fragments thereof 
specific for cell surface determinants of the desired immune system cells. A liposome 
suspension containing a peptide or conjugate may be administered intravenously, locally, 
topically, etc. in a dose which varies according to, inter alia, the manner of administration, the 
conjugate being delivered, and the stage of the disease being treated. 

In order to specifically deliver a PDZ motif sequence (PL sequence) peptide 
into a specific cell type, the peptide may be linked to a cell-specific targeting moiety, which 
include but are not limited to, ligands for diverse leukocyte surface molecules such as growth 
factors, hormones and cytokines, as well as antibodies or antigen-binding fragments thereof. 
Since a large number of cell surface receptors have been identified in leukocytes, ligands or 
antibodies specific for these receptors may be used as cell-specific targeting moieties. For 
example, interleukin-2, B7-1 (CD80), B7-2 (CD86) and CD40 or peptide fragments thereof 
may be used to specifically target activated T cells (The Leucocyte Antigen Facts Book, 1997, 
Barclay et al. (eds.), Academic Press). CD28, CTLA-4 and CD40L or peptide fragments 
thereof may be used to specifically target B cells. Furthermore, Fc domains may be used to 
target certain Fc receptor-expressing cells such £is monocytes. 

Antibodies are the most versatile cell-specific targeting moieties because they 
can be generated against any cell surface antigen. Monoclonal antibodies have been generated 
against leukocyte lineage-specific markers such as certain CD antigens. Antibody variable 
region genes can be readily isolated from hybridoma cells by methods well known in the art. 
However, since antibodies are assembled between two heavy chains and two light chains, it is 
preferred that a scFv be used as a cell-specific targeting moiety in the present invention. Such 
scFv are comprised of Vh and Vl domains linked into a single polypeptide chain by a flexible 
linker peptide. 

The PDZ motif sequence (PL sequence) may be linked to a transmembrfme 
transporter sequence and a cell-specific targeting moiety to produce a tri-fiision molecule. This 
molecule can bind to a leukocyte surface molecule, passes through the membrane and targets 
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PDZ domains. Alternatively, a PDZ motif sequence (PL sequence) may be linked to a cell- 
specific targeting moiety that binds to a surface molecule that internalizes the fusion peptide. 

In an other approach, microspheres of artificial polymers of mixed amino acids 
(proteinoids) have been used to deliver pharmaceuticals. For example, U.S. Pat. No. 4,925,673 
describes drug-containing proteinoid microsphere carriers as well as methods for their 
preparation and use. These proteinoid microspheres are useful for the delivery of a number of 
active agents. Also see, U.S. Patent Nos. 5,907,030 and 6,033,884, which are incorporated 
herein by reference. 

B. Introduction of Polynucleotides into Cells 

By introducing gene sequences into cells, gene therapy can be used to treat 
conditions in which leukocytes are activated to result in deleterious consequences. In one 
embodiment, a polynucleotide that encodes a PL sequence peptide of the invention is 
introduced into a cell where it is expressed. The expressed peptide then inhibits the interaction 
of PDZ proteins and PL proteins in the cell. 

Thus, in one embodiment, the polypeptides of the invention are expressed in 
a cell by introducing a nucleic acid (e.g., a DNA expression vector or mRNA) encoding the 
desired protein or peptide into the cell. Expression may be either constitutive or inducible 
depending on the vector and choice of promoter. Methods for introduction and expression of 
nucleic acids into a cell are well known in the art and described herein. 

In a specific embodiment, nucleic acids comprising a sequence encoding a 
peptide disclosed herein, are administered to a human subject. In this embodiment of the 
invention, the nucleic acid produces its encoded product that mediates a therapeutic effect, 
Any of the methods for gene therapy available in the art can be used according to the present 
invention. Exemplary methods are described below. 

For general reviews of the methods of gene therapy, see Goldspiel et al., 1993, 
Clinical Pharmacy 12:488-505; Wu and Wu, 1991, Biotherapy 3:87-95;Tolstoshev, 1993, Ann. 
Rev. Pharmacol. Toxicol. 32:573-596; Mulligan, 1993, Science 260:926-932; and Morgan and 
Anderson, 1993, Ann. Rev. Biochem. 62:191-217; May, 1993, TIBTECH 1 1(5):155-215. 
Methods commonly known in the art of recombinant DNA technology which can be used are 
described in Ausubel et al. (eds.), 1993, Current Protocols in Molecular Biology, John Wiley 
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& Sons, NY; and Kriegler, 1990, Gene Transfer and Expression, A Laboratory Manual, 
Stockton Press, NY. 

In a preferred embodiment of the invention, the therapeutic composition 
comprises a coding sequence that is part of an expression vector. In particular, such a nucleic 
5 acid has a promoter operably linked to the coding sequence, said promoter being inducible or 
constitutive, and, optionally, tissue-specific. In another specific embodiment, a nucleic acid 
molecule is used in which the coding sequence and any other desired sequences are flanked by 
regions that promote homologous recombination at a desired site in the genome, thus providing 
for intrachromosomal expression of the nucleic acid (KoUer and Smithies, 1989, Proc. Natl. 

10 Acad. Sci. USA 86:8932-8935; Zijlstra et al., 1989, Nature 342:435-438). 

Delivery of the nucldc acid into a patient may be either direct, in which case the 
patient is directly exposed to the nucleic acid or nucleic acid-carrying vector, or indirect, in 
which case, cells are first transformed with the nucleic acid in vitro, then transplanted into the 
patient. These two approaches are known, respectively, as in vivo or ex vivo gene therapy. 

15 In a specific embodiment, the nucleic acid is directly administered in vivo, where 

it is expressed to produce the encoded product. This can be accomplished by any methods 
known in the art, e.g., by constructing it as part of an appropriate nucleic acid expression 
vector and administering it so that it becomes intracellular, e.g., by infection using a defective 
or attenuated retroviral or other viral vector (see U.S. Patent No. 4,980,286), by direct injection 

20 of naked DNA, by use of microparticle bombardment (e.g., a gene gun; Biolistic, Dupont), by 
coating with lipids or cell-surface receptors or transfecting agents, by encapsulation in 
liposomes, microparticles, or microcapsules, by administering it in linkage to a peptide which 
is known to enter the nucleus, or by administering it in linkage to a ligand subject to receptor- 
mediated endocytosis (see e.g., Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432) which can 

25 be used to target cell types specifically expressing the receptors. In another embodiment, a 
nucleic acid-ligand complex can be formed in which the ligand comprises a fusogenic viral 
peptide to disrupt endosomes, allowdng the nucleic acid to avoid lysosomal degradation. In yet 
another embodiment, the nucleic acid can be targeted in vivo for cell specific uptake and 
expression, by targeting a specific receptor (see, e.g., PCT Publications WO 92/06180 dated 

30 April 16, 1992; WO 92/22635 dated December 23, 1992; WO92/20316 dated November 26, 
1992; W093/14188 dated July 22, 1993; WO 93/20221 dated October 14, 1993). 
Altematively, the nucleic acid can be introduced intracellularly and incorporated within host 
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cell DNAfor expression, by homologousrecombination(Kollerand Smithies, 1989, Proc.Natl. 
Acad. Sci. USA 86:8932-8935; Zijlstra et al., 1989, Nature 342:435-438). 

In a preferred embodiment of the invention, adenoviruses as viral vectors can be 
used in gene therapy. Adenoviruses have the advantage of being capable of infecting non- 
5 dividing cells (Kozarsky and Wilson, 1993, Current Opinion in Genetics and Development 
3:499-503). Other instances of the use of adenoviruses in gene therapy can be found in 
Rosenfeld et al., 1991, Science 252:431-434; Rosenfeld et al, 1992, Cell 68:143-155; and 
Mastrangeli etaL, 1993, J. Clin. Invest. 91:225-234. Furthermore, adenoviral vectors with 
modified tropism may be used for cell specific targeting (WO98/40508). Adeno-associated 

10 virus (AAV) has also been proposed for use in gene therapy (Walsh et al., 1993, Proc. Soc. 
Exp. Biol. Med. 204:289-300). 

In addition, retrpviralvectors (see Miller etal., 1993, Meth. Enzymol. 217:581- 
599) have been modified to delete retroviral sequences that are not necessiary for packaging of 
the viral genome arid integration into host cell DNA. The coding sequence to be used in gene 

15 therapy is cloned into the vector, which facilitates delivery of the gene into a patient. More 
detail about retroviral vectors can be found in Boesen et al., 1994, Biotherapy 6:291 -302, which 
describes the use of a retroviral vector to deliver the mdrl gene to hematopoietic stem cells in 
order to make the stem cells more resistant to chemotherapy. Other references illustrating the 
use of retroviral vectors in gene therapy are: Clowes et al., 1994, J. Clin. Invest. 93:644-651; 

20 Kiem et al., 1994, Blood 83:1467-1473; Salmons and Gunzberg, 1993, Human Gene Therapy 
4:129-141; and Grossman and Wilson, 1993, Curr. Opin. in Genetics and Devel. 3:110-1 14. 

Another approach to gene therapy involves transferring a gene to cells in tissue 
culture. Usually, the method of transfer includes the transfer of a selectable marker to the cells. 
The cells are then placed under selection to isolate those cells that have taken up and are 

25 expressing the transferred gene. Those cells are then delivered to a patient. 

In this embodiment, the nucleic acid is introduced into a cell prior to 
administration in vivo of the resulting recombinant cell. Such introduction can be carried out 
by any method known in the art, including but not limited to transfection, electroporation, 
lipofection, microinjection, infection with a viral or bacteriophage vector containing the nucleic 

30 acid sequences, cell fusion, chromosome-mediated gene transfer, microcell-mediated gene 
transfer, spheroplast fusion, etc. Numerous techniques are known in the art for the 
introduction of foreign genes into cells (see e.g., Loeflfler and Behr, 1993, Meth. Enzymol. 
217:599-618; Cohen et al., 1993, Meth. Enzymol. 217:618-644; Cline, 1985, Pharmac. Then 
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29:69-92) and may be used in accordance with the present invention, provided that the 
necessary developmental and physiological functions of the recipient cells are not disrupted. 
The technique should provide for the stable transfer of the nucleic acid to the cell, so that the 
nucleic acid is expressible by the cell and preferably heritable and expressible by its cell 
5 progeny. In a preferred embodiment, the cell used for gene therapy is autologous to the patient. 

In a specific embodiment, the nucleic acid to be introduced for purposes of gene 
therapy comprises an inducible promoter operably linked to the coding sequence, such that 
expression of the nucleic acid is controllable by controlling the presence or absence of the 
appropriate inducer of transcription. 

1 0 Oligonucleotides such as anti-sense RNA and DNA molecules, and ribozymes 

that function to inhibit the translation of a targeted mRNA, especially its C-terminus are also 
within the scope of the invention. Anti-sense RNA and DNA molecules act to directly block 
the translation of mRNA by binding to targeted mRNA and preventing protein translation. In 
regard to antisense DNA, oligodeoxyribonucleotidesderived from the translation initiation site, 

15 e.g., between -10 and +10 regions of a nucleotide sequence, are preferred. 

The antisense oligonucleotide may comprise at least one modified base moiety 
which is selected from the group including, but not limited to, 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetyIcytosine, 
5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 

20 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopenteny ladenine, 1 -methylguanine, 1 -methy linosine, 2,2-dimethy Iguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 

25 isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 

2- thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil- 
5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino- 

3 - N-2-carboxy propyl) uracil, (acp3)w, and 2,6-diaminopurine. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific 
30 cleavage of RNA. The mechanism of ribozyme action involves sequence specific hybridization 
of the ribozyme molecule to complemertary target RNA, followed by endonucleolyticcleavage. 
Within the scope of the invention are engineered hammerhead motif ribozyme molecules that 
specifically and efficiently catalyze endonucleolytic cleavage of target RNA sequences. 
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Specific ribozyme cleavage sites within any potential RNA target are initially 
identified by scanning the target molecule for ribozyme cleavage sites which include the 
following sequences, GUA, GUU and GUC. Once identified, short RNA sequences of between 
1 5 and 20 ribonucleotides corresponding to the region of the target gene containing the cleavage 
site may be evaluated for predicted structural features such as secondary structure that riiay 
render the oligonucleotide sequence unsuitable. The suitability of candidate targets may also 
be evaluated by testing their accessibility to hybridization with complementary 
oligonucleotides, using ribonuclease protection assays. 

The anti-sense RNA and DNA molecules and ribozymes of the invention may 
be prepared by any method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligodeoxyrbonucleotideswell knovsoi in the art 
such as for example solid phase phosphoramidite chemical synthesis. Alternatively, RNA 
molecules may be generated by in vitro and in vivo transcription of DNA sequences encoding 
the RNA molecule. Such DNA sequences may be incorporated into a wide variety of vectors 
which contain suitable RNA polymerase promoters such as the T7 or SP6 polymerase 
promoters. Alternatively, antisense cDNA constructs that synthesize antisense RNA 
constitutively or inducibly, depending on the promoter used, can be introduced stably into cell 
lines. 

Various modifications to the DNA molecules may be introduced as a means of 
increasing intracellular stability and half-life. Possible modifications include, but are not limited 
to, the addition of flanking sequences of ribo- or deoxy- nucleotides to the 5* and/or 3' ends of 
the molecule or the use of phosphorothioate or T O-methyl rather than phosphodiesterase 
linkages within the oligodeoxyribonucleotide backbone. 

C. Other Pharmaceutical Compositions 

The compounds of the invention, may be administered to a subject per se or in 
the form of a sterile composition or a pharmaceutical composition. Pharmaceutical 
compositions comprising the compounds of the invention may be manufactured by means of 
conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, 
encapsulating, entrapping or lyophilizing processes. Pharmaceutical compositions may be 
formulated in conventional manner using one or more physiologically acceptable carriers, 
diluents, excipients or auxiliaries that facilitate processing of the active peptides or peptide 
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analogues into preparations which can be used pharmaceutically. Proper formulation is 
dependent upon the route of administration chosen. * 

For topical administration the compounds of the invention may be formulated 
as solutions, gels, ointments, creams, suspensions, etc. as are well-known in the art. 
5 Systemic formulations include those designed for administration by injection, 

e.g. subcutaneous, intravenous, intramuscular, intrathecal or intraperitoneal injection, as well 
as those designed for transdermal, transmucosal, oral or pulmonary administration. 

For injection, the compounds of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks*s solution. Ringer's 
1 0 solution, or physiological saline buffer. The solution may contain formulatory agents such as 
suspending, stabilizing and/or dispersing agents. 

Alternatively, the compounds may be in powder form for constitution with a 
suitable vehicle, e.g., sterile pyrogen-free water, before use. 

For transmucosal administration, penetrants appropriate to the barrier to be 
15 permeated are used in the formulation. Such penetrants are generally known in the art. This 
route of administration may be used to deliver the compounds to the nasal cavity. 

For oral administration, the compounds can be readily formulated by combining 
the active peptides or peptide analogues with pharmaceutically acceptable carriers well known 
in the art. Such carriers enable the compounds of the invention to be formulated as tablets, 
20 pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral 
ingestion by a patient to be treated. For oral solid formulations such as, for example, powders, 
capsules and tablets, suitable excipients include fillers such as sugars, such as lactose, sucrose, 
meinnitol and sorbitol; cellulose preparations such as maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, 
25 sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP); granulating agents; and 
binding agents. If desired, disintegrating agents may be added, such as the cross-linked 
polyvinylpyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate. 

If desired, solid dosage forms may be sugar-coated or enteric-coated using 
standard techniques. 

30 For oral liquid preparations such as, for example, suspensions, elixirs and 

solutions, suitable carriers, excipients or diluents include water, glycols, oils, alcohols, etc. 
Additionally, flavoring agents, preservatives, coloring agents and the like may be added. 
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For buccal administration, the compounds may take the form of tablets, 
lozenges, etc. formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the 
present invention are conveniently delivered in the form of an aerosol spray from pressurized 
5 packs or a nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 
trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the 
case of a pressurized aerosol the dosage unit may be determined by providing a valve to deliver 
a metered amount. Capsules and cartridges of e.g. gelatin for use in an inhaler or insufflator may 
be formulated containing a powder mix of the compound and a suitable powder base such as 

10 lactose or starch. 

The compounds may also be formulated in rectal or vaginal compositions such 
as suppositories or retention enemas, e.g., containing conventional suppository bases such as 
cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may also 

15 be formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

2Q Alternatively, other pharmaceutical delivery systems may be employed. 

Liposomes and emulsions are well known examples of delivery vehicles that may be used to 
deliver peptides and peptide analogues of the invention. Certain organic solvents such as 
dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a sustained-release system, such as 

25 semipermeable matrices of solid polymers containing the therapeutic agent. Various of 
sustained-releasematerials have been established and are well known by those skilled in the art. 
Sustained-release capsules may, depending on their chemical nature, release the compounds for 
a few weeks up to over 1 00 days. Depending on the chemical nature and the biological stability 
of the therapeutic reagent, additional strategies for protein stabilization may be employed. 

30 As the compounds of the invention may contain charged side chains or termini, 

they may be included in any of the above-described formulations as the free acids or bases or 
as pharmaceutically acceptable salts. Pharmaceutically acceptable salts are those salts which 
substantially retain the biologic activity of the free bases and which are prepared by reaction 
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with inorganic acids. Pharmaceutical salts tend to be more soluble in aqueous and other protic 
solvents than are the corresponding free base forms. 

D. Effective Dosages 

The compounds of the invention will generally be used in an amount effective 
5 to achieve the intended purpose. The compounds of the invention or pharmaceutical 
compositions thereof, are administered or applied in a therapeutically effective amount. By 
therapeutically effective amount is meant an amount effective ameliorate or prevent the 
symptoms, or prolong the survival of, the patient being treated. Determination of a 
therapeutically effective amount is well within the capabilities of those skilled in the art, 
10 especially in light of the detailed disclosure provided herein. An "inhibitory amount" or 
"inhibitory concentration" of a PL-PDZ binding inhibitor is an amount that reduces binding by 
at least about 40%, preferably at least about 50%, often at least about 70%, and even as much 
as at least about 90%. Binding can as measured in vitro (e.g., in an A assay or G assay) or in 
situ. 

15 For systemic administration, a therapeutically effective dose can be estimated 

initially from in vitro assays. For example, a dose can be formulated in animal models to 
achieve a circulating concentration range that includes the IC50 as determined in cell culture. 
Such information can be used to more accurately determine useful doses in humans. 

Initial dosages can also be estimated from in vivo data, e.g., animal models, using 
20 techniques that are well known in the art. One having ordinary skill in the art could readily 
optimize administration to humans based on animal data. 

Dosage amount and interval may be adjusted individually to provide plasma 
levels of the compounds that are sufficient to maintain therapeutic effect. Usual patient 
dosages for administration by injection range from about 0.1 to 5 mg/kg/day, preferably from 
25 about 0.5 to 1 mg/kg/day. Therapeutically effective serum levels may be achieved by 
administering multiple doses each day. 

In cases of local administration or selective uptake, the effective local 
concentration of the compounds may not be related to plasma concentration. One having skill 
in the art will be able to optimize therapeutically effective local dosages without undue 
30 experimentation. 

The amount of compound administered will, of course, be dependent on the 
subject being treated, on the subject's weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 
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The therapy may be repeated intermittently while symptoms detectable or even 
when they are not detectable. The therapy may be provided alone or in combination with other 
drugs. In the case of conditions associated with leukocyte activation such as transplantation 
rejection and autoimmunity, the drugs that may be used in combination with the compounds 
5 of the invention include, but are not limited to, steroid and non-steroid anti-inflammatory 
agents. 

E. Toxicity 

Preferably, a therapeutically effective dose of the compounds described herein 

10 will provide therapeutic benefit without causing substantial toxicity. 

Toxicity of the compounds described herein can be detemiined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g,, by determining the 
LD50 (the dose lethal to 50% of the population) or the LDioo (the dose lethal to 100% of the 
population). The dose ratio between toxic and therapeutic effect is the therapeutic index. 

1 5 Compounds which exhibit high therapeutic indices are preferred. The data obtained from these 
cell culture assays and animal studies can be used in formulating a dosage range that is not toxic 
for use in human. The dosage of the compounds described herein lies preferably within a range 
of circulating concentrations that include the effective dose with little or no toxicity. The 
dosage may vary within this range depending upon the dosage form employed and the route of 

20 administration utilized. The exact formulation, route of administration and dosage can be 
chosen by the individual physician in view of the patienfs condition. (See, e.g., Fingl et aL, 
1975, In: The Pharmacological Basis of Therapeutics, Ch.l, p.l). 

25 EXAMPLE 1 

GENERATION OF EUKARYOTIC EXPRESSION CONSTRUCTS BEARING DNA 
FRAGMENTS THAT ENCODE PDZ DOMAIN CONTAINING GENES OR 
PORTIONS OF PDZ DOMAIN GENES 

30 

This example describes the cloning of PDZ domain containing genes or portions 
of PDZ domain containing genes were into eukaryotic expression vectors in fusion with a 
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number of protein tags, including but not limited to Glutathione S-Transferase (GST), Enhanced 
Green Fluorescent Protein (EGFP), or Hemagglutinin (HA). 

A. Strategy 

5 DNA fragments corresponding to PDZ domain containing genes were generated 

by RT-PCR from RNA from a library of individual cell lines (CLONTECH Cat# K4000-1) 
derived RNA, using random (oligo-nucleotide) primers (Invitrogen Cat.# 4819001 1). DNA 
fragments corresponding to PDZ domain containing genes or portions of PDZ domain 
containing genes were generated by standard PGR, using above purified cDNA fragments and 

10 specific primers (see Table 5). Primers used were designed to create restriction nuclease 
recognition sites at the PGR fragment's ends, to allow cloning of those fragments into 
appropriate expression vectors. Subsequent to PGR, DNA samples were submitted to agarose 
gel electrophoresis. Bands corresponding to the expected size were excised. DNA was 
extracted by Sephaglas Band Prep Kit (Amersham Pharmacia Gat# 27-9285-01) and digested 

15 with appropriate restriction endonuclease. Digested DNA samples were purified once more 
by gel electrophoresis, according to the same protocol used above. Purified DNA fragments 
were coprecipitated and ligated with the appropriate linearized vector. After transformation 
into KcolU bacterial colonies were screened by colony PGR and restriction digest for the 
presence and correct orientation of insert. Positive clones were innoculatedin liquid culture for 

20 large scale DNA purification. The insert and flanking vector sites from the purified plasmid 
DNA were sequenced to ensure correct sequence of fragments and junctions between the 
vectors and fusion proteins. 

B. Vectors: 

25 All PDZ domain-containing genes were cloned into the vector pGEX-3X 

(Amersham Pharmacia #27-4803-01, Genemed Acc#U13852, GI#595717), containing a tac 
promoter, GST, Factor Xa, P-lactamase, and lac repressor. 

The ammo acid sequence of tlie pGEX-3X coding region including GST, Factor 
Xa, and the multiple cloning site is listed below. Note that linker sequences between the cloned 
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inserts and GST-Factor Xa vary depending on the restriction endonuclease used for cloning. 
Amino acids in the translated region below that may change depending on the insertion used 
are indicated in small caps, and are included as changed in the construct sequence listed in (C). 
aa 1 - aa 232: 

5 MSPILGYWKIKGLVQPTRLLLEYLEEKYEEHLYERDEGDKWRNKKFELGLEFP 
NLPYYIDGDVKLTQSMAIIRYIADKHNMLGGCPKERAEISMLEGAVLDIRYGV 
SRIAYSKDFETLKVDFLSKLPEMLKMFEDRLCHKTYLNGDHVTHPDFMLYDA 
LDVVLYMDPMCLDAFPKLVCFKKRIEAIPQIDKYLKSSKYIAWPLQGWQATF 
GGGDHPPKSDLIEGRgipgnss 

10 

In addition, TAX Interacting Protein 1 (TIP 1), in whole or part, was cloned into 
many other expression vectors, includingbut not limited to CD5y,PEAK10 (both provided by 
the laboratory of Dr. Brian Seed at Harvard University and generated by recombinant DN A 
technology, containing an IgG region), and MIN (a derivative of MSG V, containing IRES and 
15 NGFR, generated by recombinant DNA technology). 



C. Constructs: 

Primers used to generate DNA fragments by PGR are listed in Table 5. PGR 
primer combinations and restriction sites for insert and vector are listed below, along with 
20 amino acid translation for insert and restriction sites. Non-native amino acid sequences are 
shown in lower case. 



TABLE 5. Primers used in cloning of DLG 1 (domain 2 of 3), MAGI 1 (domain 2 of 6), and 



TIPl into representative expression vectors. 



1D# (Primer Name) 


Primer Sequence 


Description 


1928 (654DL1 2F) 


AATGGGGATCCAGCT 
CATTAAAGG 


Forward (5' to 3') primer corresponding to 
DLG 1, domain 2 of 3. Generates a Bam HI 
site upstream (5') of the PDZ boundary. Used 
for cloning into pGEX-3X. 


1929 (655DL1 2R) 


ATACATACTTGTGGA 
ATTCGCCAC 


Reverse (3' to 5') primer corresponding to 
DLG 1, domain 2 of 3. Generates an EcoRl 
site downstream (3') of the PDZ boundary. 
Used for cloning into pGEX-3X. 


1453 (435BAF) 


CACGGATCCCTTCTG 
AGTTGAAAGGC 


Forward (5' to 3') primer corresponding to 
MAGI 1, domain 2 of 6. Generates a BamHl 
site upstream (5') of the PDZ boundary. Used 
for cloning into pGEX-3X. 



103 



1454 (436BAR) 


TATGAATTCCATCTG 
GATCAAAAGGCAATG 


Reverse (3' to 5') primer corresponding to 
MAGI 1, domain 2 of 6. Generates an EcoRl 
site downstream (3') of the PDZ boundary. 
Used for cloning into pGEX-3X. 


399 (86TAF) 


CAGGGATCCAAAGAG 
TTGAAATTCACAAGC 


Forward (5' to 3') primer corresponding to 
TIPl. Generates a Bam HI site upstream (5') 
of the PDZ boundary. Used for cloning into 
PGEX-3X. 


400 (87TAR) 


ACGGAATTCTGCAGC 
GACTGCCGCGTC 


Reverse (3' to 5') primer corresponding to 
TlPl. Generates an EcoRl site downstream 
(3') of the PDZ boundary. Used for cloning 
into PGEX-3X. 


1319 (TIP G5-1) 


AGGATCCAGATGTCC 
TACATCCC 


Forward (5' to 3') primer corresponding to 
TIPl. Generates a Bam HI site upstream (5') 
of ' the start codon. Used for cloning into 
PGEX-3X. 


1320 (TIP G3-1) 


GGAATTCATGGACTG 
CTGCACGG 


Reverse (3' to 5') primer corresponding to 
TIPl. Generates an EcoRl site downstream 
(3') of the stop codon. Used for cloning into 
PGEX-3X. 


2753 (1109TIF) 


AGAGAATTCTCGAGA 
TGTCCTACATCCC 


Forward (5' to 3') primer corresponding to 
TIPl. Generates an EcoRl site upstream (5') 
of the start codon. Used for cloning into MIN. 


2762 (1117TIR) 


TGGGAATTCCTAGGA 
CAGCATGGACTG 


Reverse (3' to 5') primer corresponding to 
TIPl. Generates an EcoRl site downstream 
(3') of the stop codon. Used for cloning into 
MIN. 


2584 (I080TIF) 


CTAGGATCCGGGCCA 
GCCGGTCACC 


Forward (5' to 3'). primer corresponding to 
TIPl. Generates a Bam HI site upstream (5') 
of the PDZ boundary. Used for cloning into 
PEAK 10 or CDSy. 


2585 (1081T1R) 


GACGGATCCCCCTGC 
TGCACGGCCTTCTG 


Reverse (3' to 5') primer corresponding to 
TIPl. Generates a Bam HI site downstream 
(3') of the PDZ boundary. Used for cloning 
into PEAK 10 or CDSy. 


2586 (1082TIR) 


GACGAATTCCCCTGC 
TGCACGGCCTTCTG 


Reverse (3' to 5') primer corresponding to 
TIPl. Generates an EcoRl site downstream 
(3') of the PDZ boundary. Used for cloning 
into PEAK 10 or CDSy. 


2587 (1083TIF) 


CTAGAATTCGGGCCA 
GCCGGTCACC 


Forward (5' to 3') primer corresponding to 
TIPl . Generates an Eco Rl site upstream (5 ') 
of the PDZ boundary. Used for cloning into 
PEAK 10 or CDSy. 



1. DLG 1. PDZ domain 2 of 3: 
Acc#:U13897 
GI#:558437 

•Construct: DLG 1, PDZ domain 2 of 3-pGEX-3X 
Primers: 1928 & 1929 
Vector Cloning Sites(5'/3'): Bam Hl/EcoRl 
Insert Cloning Sites(5V3'): BamHl/EcoRl 
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aal-aa88 

giqLIKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIEGGAAHKDGKLQIG 
DKLLAVNNVCLEEVTHEEAVTALKNTSDFVYLKVAnss 

2. MAGI 1. PDZ domain 2 of 6: 

Acc#:ABO 10894 
GI#:3370997 

•Construct: MAGI 1, PDZ domain 2 of 6-pGEX-3X 

Primers: 1453 & 1454 

Vector Cloning Sites(5'/3'): Bam Hl/EcoRl 

Insert Cloning Sites(5V3'): BamHl/EcoRl 

aal-aal08 

giPSELKGKFIHTKLRKSSRGFGFTVVGGDEPDEFLQIKSLVLD 

GPAALDGKMETGDVIVSVNDTCVLGHTHAQVVKIFQSIPIG 

ASVDLELCRGYPLPFDPDgihrd 

3. TAX Interacting Protein 1 (TIPH: 

Acc#:AF028823.2 
GI#:1 1908159 
•Construct: TIPl, PDZ domain 1 of l-pGEX-3X 
Primers: 399& 400 

Vector Cloning Sites(5V3'): Bam Hl/EcoRl 
Insert Cloning Sites(5 73 '): BamH 1/EcoRl 

aal-aal07 

giQRVEIHKLRQGENLILGFSIGGGIDQDPSQNPFSEDKTDKGl 
YVTRVSEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQAR 
KRLTKRSEEVVRLLVTRQSLQnss 

•Construct: TIPl-pGEX-3X 

Primers: 1319& 1320 

Vector Cloning Sites(5'/3'): Bam Hl/EcoRl 
Insert Cloning Sites(573'): BamHl/EcoRl 

aal-aal28 

giqMSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIGGGIDQDPSQNPF 

SEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDMTMVTHD 

QARKRLTKRSEEVVRLLVTRQSLQKAVQQSMnss 

•Construct: TIPl-MIN 

Primers: 2753& 2762 

Vector Cloning Sites(573'): EcoRl/EcoRl 

Insert Cloning Sites(573'): EcoRl/EcoRl 
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aa 1-aa 129 



5 



agilEMSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIGGGIDQ 
DPSQNPFSEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVN 
GWDMTMVTHDQARKRLTKRSEEVVRLLVTRQSLQKAVQQ 
SMLS 
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•Construct: TIPl-CDSy 

Primers: 2584& 2585 

Vector Cloning Sites(5 73'): Bam HI/ Bam HI 
Insert Cloning Sites(573'): BamHl/ Bam HI 



15 



aa 1-aa 122 

adPGQPVTAVVQRVEIHKLRQGENLILGFSIGGGIDQDPSQNP 

FSEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDM 

TMVTHDQARKRLTKRSEEVVRLLVTRQSLQKAVQQSdpe 



D. GST Fusion Protein Production and Purification 

The constructs using pGEX-3X expression vector were used to make fusion 
20 proteins according to the protocol outlined in the GST Fusion System, Second Edition, 
Revision 2, Pharmacia Biotech. Method II and was optimized for a IL LgPP. 

Purified DNA was transformed into E.coli and allowed to grow to an OD of 
0.4-0.8 (600A,). Protein expression was induced for 1-2 hours by addition of IPTG to cell 
culture. Cells were harvested and lysed. Lysate was collected and GS4B beads (Pharmacia 
25 Cat# 17-0756-01) were added to bind GST fusion proteins. Beads were isolated and GST 
fusion proteins were eluted with GEB II. Purified proteins were stored in GEB II at -SO^'C. 
Purified proteins were used for ELISA-based assays and antibody 

production. 

30 E. IgG Fusion Protein Production and Purification 



used to make fusion protein. Purified DNA vectors were transfected into 293 EBNA T cells 
under standard growth conditions (DMEM +10% FCS) using standard calcium phosphate 
precipitation methods (Sambrook, Fritsch and Maniatis, Cold Spring Harbor Press) at a ratio 
35 of --1 ug vector DNA for 1 million cells. This vector results in a fusion protein that is secreted 



The constructs using the CD5gamma or PeaklOIgG expression vectors were 
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into the growth medium. Transiently transfected cells are tested for peak expression, and 
growth media containing fusion protein is collected at that maxima (usually 1 -2 days). Fusion 
proteins are either purified using Protein A chromatography or frozen directly in the growth 
media without addition. 



10 EXAMPLE! 

IDENTIFICATION OF INTERLEUKIN 8 RECEPTOR A (ILSRA) INTERACTIONS 
WITH MAGIl (domain 2\ TIPl (domain n and MINT2 fdomains 1 & 2HN VITRO 

This example describes the binding of IL8RA to MAGIl (domain 2 of 6), TIPl, and 
Mint2 (domains 1 & 2), assessed using a modified ELIS A. Briefly, a GST-PDZ fiasion was 
produced that contained the entire PDZ domain of human MAGIl or TIPl (see Example 2), 
In the case of Mint2, domains 1 and 2, the GST-PDZ fusion contained the entire PDZ domain 
for both domains 1 £ind 2. In addition, biotinylated peptide corresponding to the C-terminal 
20 amino acids of IL8RA was synthesized and purified by HPLC. Binding between these 
entities was detected through the "G" Assay^ a colorimetric assay using avidin-HRP to bind 
the biotin and a peroxidase substrate. 

A. Peptide Purification 
25 Peptide representing the C-terminal 20 amino acids of IL8RA was synthesized by 

standard FMOC chemistry and biotinylated if not used as an unlabeled competitor. Peptide 
was purified by reverse phase high performance liquid chromatography (HPLC) using a Vydac 
218TP CI 8 Reversed Phase column having the dimensions of 10*25 mm, 5 um. 
Approximately 40 mg of peptide was dissolved in 2.0 ml of aqueous solution of 49.9% 
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,15 



20 



acetonitrile and 0.1% Tri-Fluoro acetic acid (TFA). This solution was then injected into the 
HPLC machine through a 25 micron syringe filter (Millipore), Buffers used to get a good 
separation are (A) distilled water with 0.1% TFA and (B) 0.1% TFA with Acetonitrile. 
Gradient Segment setup is listed in Table 7. 



TABLE 7. 



Time 


A 


B 


C 


Flow rate (ml/min) 


0 


96% 


4% 


0 


5.00 


30 


100% 


100% 


0 


5.00 


35 


100% 


100% 


0 


5.00 


40 


96% 


4% 


0 


5.00 



The separation occurs based on the nature of the peptides. A peptide of overall hydrophobic 
nature will elute off later than a peptide of a hydrophilic nature. Fractions containing the 
"pure" peptide were collected and checked by Mass Spectrometer (MS). Purified peptides are 
lyophilized for stability and later use. 

B. "G" Assay for Identification of Interactions Between Peptides and Fusion Proteins 



Reagents and Materials 

• Nunc Polysorp 96 well Immuno-plate (Nunc cat#62409-005) 

(Maxisorp plates have been shown to have higher background signal) 

• PBS pH 7.4 (Gibco BRL cat# 16777- 148) or 

AVC phosphate buffered saline, 8gm NaCl, 0.29 gm KCl, 1.44 gm Na2HP04, 
0.24gm KH2PO4, add H20 to 1 L and pH 7.4; 0.2 _ filter 

• 2% BSA/PBS (lOgm of bovine serum albumin, fraction V (ICN Biomedicals 

cat#IC15142983) into 500 ml PBS 

• Goat anti-GST mAb stock @ 5 mg/ml, store at 4^C, (Amersham Pharmacia 

cat#27-4577-01), dilute 1:1000 in PBS, final concentration 5 _g/ml 

• HRP-Streptavidin, 2.5mg/2ml stock stored at 4''C (Zymed cat#43-4323), 

dilute 1:2000 into 2% BSA, final concentration at 0.5 _g/ml 

• Wash Buffer, 0.2% Tween 20 in 50mM Tris pH 8.0 

• TMB ready to use (Dako cat#S1600) 

• IM H2S04 

• 12w multichannel pipettor, 

• 50 ml reagent reservoirs, 

• 15 ml polypropylene conical tubes 
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Protocol 



1) Coat plate with 100 ul of 5 ug/ml goat anti GST, O/N @ 4°C 

2) Dump coating antibodies out and tap dry 

3) Blocking - Add 200 ul per well 2% BSA, 2 hrs at 4^C 
5 4) Prepare proteins in 2% BSA 

(2ml per row or per two columns) 

5) 3 washes with cold PBS (must be cold through entire experiment) 
(at last wash leave PBS in wells until immediately adding next step) 

6) Add proteins at 50ul per well on ice (1 to 2 hrs at 4^C) 
10 7) Prepare Peptides in 2% BSA (2 ml/row or /columns) 

8) 3 X wash with cold PBS 

9) Add peptides at 50 ul per well on ice ( time on / time off ) 

keep on ice after last peptide has been added for 10 minutes exactly 
place at room temp for 20 minutes exactly 
15 10) Prepare 12 ml/plate of HRP-Streptavidin (1 :2000 dilution in 2%BSA) 

11) 3 X wash with cold PBS 

1 2) Add HRP-Streptavidin at 1 00 ul per well on ice, 20 minutes at 4''C 

13) Turn on plate reader and prepare files 

14) 5 X washes, avoid bubbles 

20 15) Using gloves, add TMB substrate at 100 ul per well 

incubate in dark at room temp 
check plate periodically (5, 10, & 20 minutes) 
take early readings, if necessary, at 650 nm (blue) 
- at 20 minutes, stop reaction with 100 ul of IM H2S04 

25 - take last reading at 450mn (yellow) 



C. Results of Binding Experiments 

30 Results of peptides representing the carboxy-terminal20 amino acids of IL8RA binding 

to MAGI 1 , domain 2 of 6, TIPl , and Mint2, domains 1 and 2, are shown in Figure 1 . Clearly, 
IL8RA binds GST-MAGIl domain 2 and GST-TIP 1 with much higher affinity than it does to 
GST-Mint2 domains 1 & 2 at equivalent peptide concentrations and with equivalent amount 
of GST-PDZ fusion protein. Because the interaction between I18RA and Mint2 is not 

35 significantly higher than background, Mint2 PDZ's may not interact with IL8RA PL peptide 
when tested in this assay. 
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D. Conclusions and Summary 

MAGIl (domain 2) and TIPl bind to IL8RA better than Mint2 (domains 1 and 2) bind 
to the same peptide. 

5 The "G" Assay provides an accurate method for testing the binding of PDZ proteins 

to PL peptides in vitro, and highlights the specificity of PDZ-PL pairing. The same peptide 
can interact more or less strongly with different PDZs, and binding strength is not relative for 
the same PDZ. However, binding affinity and binding patterns of PDZ's and PL's are not 
predictable, and binding profiles may change with assay variations and data interpretation. 
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EXAMPLES 



IDENTIFICATION OF ALPHA ADRENERGIC RECEPTOR INTERACTIONS WITH 
15 PDZ PROTEINS 



This exeimple describes the binding of a subset of alpha adrenergic receptors and PDZ 
domains using the modified ELISA described in the previous example. Biotinylated peptides ^ 
corresponding to the C-terminal 20 amino acids of AlA, AlB and AlC synthesized and 
20 purified by HPLC. Binding between these entities was detected through the "G" Assay, a 
colorimetric assay using avidin-HRP to bind the biotin and a peroxidase substrate. 



Table 8A: 



AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


AF6 


1 


0.3325 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


AF6 


1 


0.5125 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


AlP.C 


1 


2.594 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


AlPG 


1 


1.93 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


AlPC 


3 


0.1465 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


AlPC 


3 


0.2165 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


AlPC 


4 


0.197 


ALPHA.2A AR 


HDFRRAFKKILARGDRKRIV 


AlPC 


, 4 


0.2285 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


APXL1 


1 


1.194 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


APXL1 




0.5545 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


CARD14 


— \ — 


1.06 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


CARD14 




0.6535 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


CASK 


1 


0.2825 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


CASK 


1 


0.475 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


CNK1 


1 


0.275 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


CNK1 


1 


0.4505 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


CYTOHESIN BINDING 
PROTEIN 


1 


0.2515 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


CYTOHESIN BINDING 
PROTEIN 


1 


0.38 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG1 


1 


0.142 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG1 




0.204 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG1 


2 


0.1665 


ALPHA-2A AR . 


HDFRRAFKKILARGDRKRIV 


DLG1 


2 


0.272 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG1 


3 


0.2415 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG1 


3 


0.6315 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG1 


1.2 


0.2435 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG1 


1,2 


0.3955 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG2 


1 


0.1185 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG2 


1 


0.2255 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG2 


2 


0.208 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG2 


2 


0.3005 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG5 


1 


0.1955 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG5 


1 


0.168 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG5 


2 


0.3655 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG5 


2 


0.6325 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG5 


2 


0.648 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DLG5 


2 


0.474 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DVL2 




0.294 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DVL2 




0.4565 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DVL3 




0.4915 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


DVL3 




0.8465 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


EBP50 




0.406 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


EBP50 




0.1385 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


EBP50 


2 


0.2395 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


EBP50 


2 


0.139 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


EBP50 


1.2 


0.2515 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


EBP50 


1.2 


0.1295 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ENIGMA 




0.3955 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ENIGMA 




0.144 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ERBIN 




0.2285 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ERBIN 




0.451 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


FLJ0001 1 




0.2725 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


FLJ0001 1 




0.402 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


FLJ11215 




0.141 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


FLJ11215 




0.2065 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


FLJ12615 




0.157 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


FLJ12615 




0.26 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


FLJ21687 




0.9965 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


FLJ21687 




0.8225 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


GRIP 1 


4 


0.402 


Al PMA 9A AR 


WnFRR AFk^l^ll ARriHRKRlV/ 
nurr\r\r\rr\r\i UMr\OL/r\f\r\i v 


Orvln 1 


A 
*r 




ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


GRIP 1 


5 


0.405 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


GRIP 1 


5 


0.3185 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


GRIP 1 


6 


0.3795 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


GRIP 1 


6 


0.177 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


GRIP 1 


7 


0.26 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


GRIP 1 


7 


0.187 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


HEMBA 1003117 


1 


0.558 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


HEMBA 1003117 




0.41^ 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


HEMBA 1003117 




0.6875 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


HEMBA 1003117 




0.8515 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 




0.336 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 




0.5975 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 


3 


1.095 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 


3 


2.1295 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 


4 


0.6395 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 


4 


1.049 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 


5 


0.2175 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 


5 


0.3455 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 


7 


0.372 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 


7 


0.5995 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 


8 


0.2785 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


INADL 


8 


0.47 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0316 




0.1965 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0316 




0.18 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0340 




0.855 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0340 




1.224 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0380 




2.061 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0380 




2.5805 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0382 




0.2085 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0382 




0.3865 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0440 




1.176 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0440 




0.733 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0559 




0,2355 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0559 




0.3155 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0751 




0.667 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0751 




1.1525 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0751 




3.4115 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0751 




2.67 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0858 




0.23 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0858 




0.3835 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0967 


1 


0.2555 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA0967 




0.1555 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1095 




0.2225 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1095 




0.328 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1095 




0.2635 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


\ KIAA1095 




0.346^ 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1222 


1 


0.3325 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1222 




0.2375 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1284 




0.8405 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1284 




0.845 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1415 




0.3215 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1415 




0.3045 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1526 




0.209 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1526 




0.3675 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1526 




3.8915 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1526 




4 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1526 


2 


0.8305 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1526 


2 


1.511 




MnFRRAFKKII ARfinRkRIV/ 


Kl A A -1 cop* 


o 

c 


n soft's 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1526 


2 


0.4095 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1620 


1 


0.231 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1620 


1 


0.152 


ALPHA>2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1719 


1 


0.2835 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1719 


1 


0.1895 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1719 


2 


0.2545 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


KIAA1719 


2 


0.203 
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AVC NAME 


SEQUENCE 


^ Ik 1 ^ Ik. 1 A K A 1^ 

GENE NAME 


■~\ AM A 1 Ik 1 

DOMAIN 


AVERAGE OD 


Al ni—IA OA AO 

ALPnA-2A AR 


1 1 r~ o o A T~ ly \ \ ao/^ooi^oi\/ 

HDFRRAFKKILARGuRKRIV 


ly 1 A K A ^ A 

KIAA1719 


3 


0.338 


Al DLJA OA AD 

ALrnA-^A AK 


nUrKKArKKILAKCjUKKKIV 


iyiAA'1?«fO 

KIAA1 719 


3 


0.2555 


Al OLJA OA AO 

ALPHA-2A AK 


LJOI^OOAC^IXL^II AO OO !✓ O l \ / 

HDrRRArKKILARGDRKRIV 


lyiAAW^wrx 

KIAA1719 


4 


2.4485 


Al l~>l_IA OA AO 

ALPHA-2A AK 


1 1 r-\ r~ n o A r~ \y \y \ \ ao/^ooixoix/ 

HDFRRAFKKILARGjDRKRIV 


ly 1 A A r> 

KIAA1719 


4 


2.433 


Al ni_lA OA AO 

ALPHA-2A AK 


HDFRRAFKKILARGuRKRIV 


ly 1 A K A ~r A 

KIAA1719 


5 


0.417 


AIIOLJA OA AO 

ALPHA-2A AR 


i_j f— k r- [-> T-\ A r~ \y ly \ I AO o n ly n i \ / 

HDrRRAFKKILARGDRKRIV 


lyiAAH^-^f^ 

KIAA1719 


5 


0.356 


AlfOLJA OA A O 

ALPHA-2A AR 


1 1 r-\ r— [-> r*> A rr ly vy 1 1 ao/^ooi^oiv/ 

HDFRRAFKKILARGDRKRIV 


KIAA171 9 , 


6 


0.264 


Al OLJA OA AO 

ALPHA-2A AR 


l—l OCT D O a 1 1 AD/^ODU^OI\/ 

HDFRRAFKKILARGDRKRIV 


lyiA A A "7 A f\ 

KIAA1719 


6 


0.1695 


Al OLJA OA AO 

ALPnA-2A AR 


LJ OfT O O A d/' ly 1 1 Ao/^ooiyoiv/ 

HDFRRAFKKILARGDRKRIV 


1 IKil IlilV/OTIi^l IC? 

LIM MYSTIQUE 




0.8755 


Al OLJA OA AO 

ALPHA-2A AR 


11 1-\ r- o o A dy ly I i AOrf^ooiyoiv/ 

HDFRRAFKKILARGDRKRIV 


1 tKM ft JIV/OT'i A^l IP 

LIM MYSTIQUE 




0.8705 


Al OLJA OA AO 

ALPnA-2A AR 


Ljocroo A iriyixii a o/^ooiyoiv/ 

HDFRRAFKKILARGDRKRIV 


LIM PROTEIN 




0.5305 


Al nUA OA AO 

ALPHA-2A AR 


1 J o r" o o A c !✓ ly 1 1 a o o o iy o i \ / 

HDFRRAFKKILARGDRKRIV 


LIM PROTEIN 




0.732 


AIOLJA OA AO 

ALPHA-2A AR 


i_j o r o o A my !✓ 1 1 AO/^ooiyoi\/ 

HDFRRAFKKILARGDRKRIV 


1 in /I oil 

LIM-RIL 




0.407 


Al OLJA OA AO 

ALPHA-2A AR 


1 jr^r~no A i~iyiyii AO/^ooiyoi\/ 

HDFRRAFKKILARGDRKRIV 


LIM-RIL 




0.4955 


A 1 Ol J A OA AO 

ALPHA-2A AR 


I J o ^ o o A c ly ly i 1 A o o o ly o i\ / 

HDFRRAFKKILARGDRKRIV 


LIMK1 




0.354 


A 1 r~ll 1 A OA A lO 

ALPHA-2A AR 


1 1 r-\ r— n A ( — ly ly 1 1 AO^^ooiyoi\/ 

HDFRRAFKKILARGDRKRIV 


LIMK1 




0.3655 


A 1 r^l 1 A OA A lO 

ALPHA-2A AR 


1 1 r% ^ n A r~ ly ly 1 1 a r~» y> n ly n / 

HDFRRAFKKILARGDRKRIV 


LIMK2 




0.344 


A 1 OLJ A O A A O 

ALPHA-2A AR 


1 i i-\ p— n i-\ A ly ly 1 1 a o o o ly o i x / 

HDFRRAFKKILARGDRKRIV 


LIMK2 




0.4015 


Ai OLJA OA AO 

ALPHA-2A AR 


uorooAnyiyii Ao^^ooiyoi\/ 

HDFRRAFKKILARGDRKRIV 


LU-1 




0.2425 


A 1 1 I A OA A f*^ 

ALPHA-2A AR 


1 1 o f" o OAr"iyiyii ao r\ o ly o i \ / 

HDFRRAFKKILARGDRKRIV 


LU-1 




0.19 


A ■ 1 1 A OA A r~i 

ALPHA-2A AR 


1 1 n n a ly ly 1 1 a n n ly n i \ / 

HDFRRAFKKILARGDRKRIV 


MAGI 1 


^ 


0.247 


AIOLIA OA AO 

ALPHA-2A AR 


1 1 1 — n r~\ /» 1 — IX ly 1 1 AO r~\ n ly o i \ / 

HDFRRAFKKILARGDRKRIV 


MAGI 1 




0.365 


A 1 1 1 A ^ A A r*> 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 1 


^ 


0.3645 


A 1 1 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 1 




0.4925 


A 1 r—t 1 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 1 


3 


0.2915 


A ■ r^i 1 A OA A ro 

ALPHA-2A AR 


1 1 1~\ ^ 1—% A ^ ly ly 1 1 a i~\ ly lo i \ / 

HDFRRAFKKILARGDRKRIV 


MAGI 1 


3 


0.4715 


A 1 1 A OA A n 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 1 


3 


2.564 


A 1 m 1 A OA A 

ALPHA'2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 1 


3 


3.664 


A 1 ni 1 A OA A 

ALPHA-2A AR 


1 1 r~\ f~ n n a ^ ly ly ■ ■ a ly i \ / 

HDFRRAFKKILARGDRKRIV 


MAGI 1 


4 


0.3085 


A 1 n 1 1 A OA A 

ALPHA-2A AR 


1 1 ^ f \ A ^ ly ly ■ ■ a rN ly i \ / 

HDFRRAFKKILARGDRKRIV 


MAGI 1 


4 


0.4115 


A 1 m I A OA A 

ALPHA-2A AR 


1 1 ^ ^> A ^ ly ly ■ ■ a ly n i \ # 

HDFRRAFKKILARGDRKRIV 


MAGI 1 


5 


0.245 


A 1 n 1 1 A OA A n 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 1 


5 


0.3925 


A 1 0 1 ■ A OA A O 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 2 


1 


0.2595 


A 1 0 1 ■ A OA A n 

ALPHA-2A AR 


1 ■ ^ A t~ \y \y 1 1 a t~\ i~\ i~» ly i \ / 

HDFRRAFKKILARGDRKRIV 


MAGI 2 


1 


0.1815 


A i O i ■ A OA A r> 

ALPHA-2A AR 


1 ■ ^ i~> A ^ ly ly 1 1 a r*» y% i~» ly i \ / 

HDFRRAFKKILARGDRKRIV 


MAGI 2 


2 


0.205 


A 1 m 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 2 


2 


0.136 


Al m 1 A OA AO 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 2 


3 


0.2925 


A 1 m 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 2 


3 


0.1885 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 2 


4 


0.144 


A 1 r^i 1 A ^ A A r™k 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 2 


4 


0.18 


A I m 1 A OA A n 

ALPHA-2A AR 


1 1 1~\ ^ lo A ^ ty \y \ \ a t~\ ^\ i~k \y f~\ i \ i 

HDFRRAFKKILARGDRKRIV 


MAGI 2 


5 


0.7415 


A 1 1 1 A ^ A A r\ 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 2 


5 


0.8035 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 2 


6 


0.763 


A 1 1 1 A ^ A A ^\ 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 2 


6 


0.9055 


A 1 1 1 A ^ A A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 3 


1 


0.272 


A 1 r\ 1 1 A A A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 3 


1 


0.499 


A 1 1 i A ^ A A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 3 


2 


0.701 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 3 


2 


1.192 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 3 


3 


0.243 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 3 


3 


0.566 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 3 


4 


0.2545 


ALPHA-2A AR 


HnFRRAFKKH ARf^DRKRIV 


MAr^i 

ivirAvji o 


4 
•t 


0.4775 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 3 


5 


0.2745 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAGI 3 


5 


0.5265 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAST1 


1 


0.4675 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAST1 


1 


0.355 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAST2 


1 


0.6125 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAST2 


1 


0.5255 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MAST2 




0.98 
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AVU iNAivIt 


SEQUENCE 1 GENE NAME 


DOMAIN 


AVERAGE OD 


Al DUIA OA AD 

ALrnA-^A AK 


HDFRRAFKKILARGDRKRIV 


MAST2 




1.7505 


Al DUIA OA AD 
MLnr1M-ZA Ar\ 


HDFRRAFKKILARGDRKRIV 


MAST4 


A 

1 


0.264 


Al Dl-IA OA AD 

ALnriA-^A AK 


HDFRRAFKKILARGDRKRIV 


MAST4 


1 


0.3355 


Al DUIA OA AD 

ALrnA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MINT1 


1 


1 .0045 


Al DUIA OA AD 

ALPMA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MINT1 


1 


0.781 


Al DUIA OA AD 

ALrnA-4iA AK 


HDFRRAFKKILARGDRKRIV 


MINT1 


2 


0.299 


Al DUIA OA AD 

ALKnA-iilA AK 


HDFRRAFKKILARGDRKRIV 


MINT1 


o 

z 


0.1895 


Al DUIA OA AD 

ALKMA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MINT1 


1 o 


3.184 


Al DLJA OA AD 

ALrrlA-zA AK 


HDFRRAFKKILARGDRKRIV 


MINT1 


A O 

1.2 


3.8385 


Al DLJA OA AD 

ALrHA-ziA AK 


HDFRRAFKKILARGDRKRIV 


MPP1 


1 


0.479 


Al DUIA OA AD 

ALrnA-zA AK 


HDFRRAFKKILARGDRKRIV 


MPP1 


1 


0.685 


Al DUIA OA AD 

ALPHA-^A AK 


HDFRRAFKKILARGDRKRIV 


MPP2 


A 
1 


0.464 


Al DUIA OA AD 

ALKnA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MPP2 


A 

\ 


0.318 


Al DLJA OA AD 

ALrHA-2A AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


1 


0.4445 


Al DLJA OA AO 

ALPHA-2A AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


1 


0.7405 


Al DLJA OA AO 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


2 


0.4995 


Al DLJA OA AO 

ALPnA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


o 

2 


0.5935 


Al DLJA OA AD 

ALPnA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


3 


0.4815 


Al DLJA OA AD 

ALPrlA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


o 

3 


0.742 


Al DLJA OA AD 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


4 


1.08 


Al DLJA OA AD 

ALPnA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


4 


1.923 


Al DUIA OA AD 

ALPnA-2A AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


c 

o 


0.3005 


Al DLJA OA AD 

ALPnA-2A AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


5 


0.706 


Al DUIA OA AD 

ALPMA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


D 


1 .1 875 


Al DLJA OA AD 

ALPnA-2A AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


6 


1.909 


Al DUIA OA AD 

ALPMA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


7 


0.377 


Al DLJA OA AD 

ALPHA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


7 


0.676 


Al DI—IA OA AD 

ALPMA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


8 


0.835 


Al DLJA OA AD 

ALPMA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


o 
O 


1 .5405 


Al DLJA OA AD 

ALPHA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


9 


0.2845 


AIDLJA OA AD 

ALPHA-ZA AK 


HDFRRAFKKILARGDRKRIV 


MUPP1 


9 


0.5165 


AiOLJA OA AO 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


10 


0.3165 


A 1 1 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


10 


0.514 


A 1 DLJ A OA AO 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


1 1 


0.309 


A 1 Ol 1 A OA A n 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


1 1 


0.6785 


A 1 OLJ A O A AO 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


12 


0.23 


Al OLJA OA AO 

ALPMA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


12 


0.3145 


AIOLJA OA A O 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


13 


0.5555 


A 1 0 1 1 A OA A f* 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


MUPP1 


13 


0.842 


At DI—IA OA AO 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


NEDLG 


1 


0.2175 


Al DI—IA OA AO 

ALPnA-2A AR 


HDFRRAFKKILARGDRKRIV 


NEDLG 


1 


0.143 


Al DLJA OA AD 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


NEDLG 


2 


0.159 


Al DLJA OA AD 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


NEDLG 


o 

2 


0.2355 


Al DLJA OA AD 

ALPnA-2A AR 


HDFRRAFKKILARGDRKRIV 


NEDLG 


o 

3 


0.137 


Al DLJA OA AD 

ALPMA-ZA AK 


HDFRRAFKKILARGDRKRIV 


NEDLG 


3 


0.2555 


Al DUIA OA AD 

ALPMA-ZA AK 


HDFRRAFKKILARGDRKRIV 


NEDLG 


A O 

1 12 


0.3165 


Al DUIA OA AD 

ALPMA-ZA AK 


HDFRRAFKKILARGDRKRIV 


NEDLG 


A O 

' .2 


0.401 


Al DLJA OA AD 

ALPMA-ZA AK 


HDFRRAFKKILARGDRKRIV 


NOS1 


A 
1 


0.7285 


Al DLJA OA AD 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


NOS1 


1 


0.96 


Al DLJA OA AO 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


NOVEL PDZ GENE 


1 


0.8105 


Al Di-i A OA AO 

ALPHA-2A AR 


i_i r-v r o o A ^ \y w A o r\ o o 1 \ / 

HDFRRArKKILARGDRKRIV 


NOVEL PDZ GENE 


1 


2.973 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


NOVEL PDZ GENE 


2 


0.363 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


NOVEL PDZ GENE 


2 


0.844 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


OUTER MEMBRANE 


1 


0.21 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


OUTER MEMBRANE 


1 


0.4655 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


P55T 


1 


0.236 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


P55T 


1 


0.1785 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PAR3 


2 


0.2675 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PAR3 


2 


0.2085 
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AVC NAME 


SEQUtNCt 


GENE NAME 


ft il A 1 K 1 

DOMAIN 


AVERAGE OD 


Al OLl A OA AO 

ALPrlA-2A AR 


l_l D^D D A !✓ 1 1 A D DD ly D 1 V / 

HDFRRAFKKILARGDRKRIV 


PAR3 


3 


1.451 


Al Dt-J A OA AD 

ALrHA-zA AK 


l—l D^ D D A ^ ly 1/ 1 1 AD/^DDl^DIV/ 

rilJrKKArr\t\ILAKCjiJKKKIV 


PARS 


3 


1.2735 


Al Dt-IA OA AD 

ALPnA-2A AR 


LJ D C D D A IX 1 1 ADODDIXDI\/ 

HDFRRAr KKILARCjDRKRIV 


PARS 


1 


0.381 


Al r>l_IA OA AD 

ALrrlA-^A AK 


l_J l~\ d D D A Cr !✓ 11 ADODDIXDIX/ 

HDrKRArKKILARouRKKIV 


D A D £5 

PARd 


1 


0.568 


AiOLJA OA AD 

ALPHA-2A AR 


HuFRRAFKKILARCjDRKRIV 


DADO A ^ /I h Jl A 

PARd gamma 


1 


0.2065 


Al DI—IA OA AD 

ALrriA-2A AK 


LJ D IT D D A criX IXI 1 A D /"» r\D D l\ / 

HUFRRAFKKILARC3DRKRIV 


DADO A R /I ft /I A 

PARd gamma 


1 


0.2425 


Al DLJA OA AD 

ALrnA-2A AR 


LJ D C D D A in/' IX 1 1 AD/^DDIXDI\/ 

HDFRRAFKKILARvjDRKRI V 


PDZ-73 


2 


0.251 


A 1 Dl—I A O A A D 

ALPHA-2A AR 


1 1 CDDAmXlXII A O D D IX D l \ / 

HDFRRAFKKILAROjDRKRIV 


PDZ-73 


2 


0.4365 


Al DI-IA OA AD 

ALPHA-2A AR 


LJf~\CrDD A iriXIXII ADODDIXDIV/ 

HDFRRAFKKILARoDRKRIV 


D D^ ^0 

PDZ-73 


0 
3 


0.2225 


Al DLJA OA AD 

ALPHA-2A AR 


LJ D^DD A ^IXIXII A D/^ r^D IXD l\/ 

HDFRRAFKKILARGDRKRIV 


DD^ TO 

PDZ-73 


3 


0.369 


Al nU A OA AD 

ALPHA-2A AR 


LJ D(^DD A CTIXIXII AD/^DDIXDH/ 

HDFRRAFKKILARGDRKRIV 


PDZK1 


1 


0.3415 


Al DLJA OA AD 

ALPHA-2A AR 


l_m^DD A ^IXIXII AD^nDIXDK/ 

HDFRRAFKKILARGDRKRIV 


PDZK1 


1 


0.608 


Al OI_IA OA AD 

ALPHA-2A AR 


LJ D d D D A nx IX 1 1 A D n\ D IX D 1 \ / 

HDFRRAFKKILARGDRKRIV 


DD'7IX H 

PDZK1 


2 


0.29 


Al DLJA OA AD 

ALPHA-2A AR 


1 1 r-k ^nDACTIXiXII AD D D IX D 1 \ / 

HDFRRAFKKILARGDRKRIV 


PDZK1 


2 


0.4915 


Al DLJA OA AD 

ALPnA-2A AR 


LJ D tTD D A nX IXI 1 AD/^DDIXDI\/ 

HDFRRAFKKILARGDRKRIV 


PDZK1 


0 
3 


0.5655 


Al DLJA OA AD 

ALPnA-2A AR 


LJ D [TD D A ^IX IXI 1 AD^^DDIXDIV/ 

HDFRRAFKKILARGDRKRIV 


PDZK1 


0 
3 


0.5355 


Al DLJA OA AD 

ALPnA-2A AR 


LJ r^^D D A ^IX IX 1 1 AD/^DDIXDI1# 

HDFRRAFKKILARGDRKRIV 


PDZK1 


4 


0.199 


Al DLJA OA AD 

ALPnA-2A AR 


LJ r^^D D A ^IXIXI 1 AD/^DDIXDIX/ 

HDFRRAFKKILARGDRKRIV 


DDTIX4 

PDZK1 


4 


0.2365 


Al DLJA OA AD 

ALPnA-2A AR 


LJr^CDD A CriXtXIl A D/^ r^D IXO IV/ 

HDFRRAFKKILARGDRKRIV 


PDZK1 


00^ 
2,3,4 


0.441 


AIDLJA OA AD 

ALPHA-2A AR 


l_l D ^ D D A nx IX 1 1 AD/^DDIXDIX/ 

HDFRRAFKKILARGDRKRIV 


PDZK1 


0 0 it 
2,3,4 


0.51 15 


Al DLJA OA AD 

ALPHA-2A AR 


LJ D^D D A CriXIXI 1 A D DD IX D 1\ / 

HDFRRAFKKILARGDRKRIV 


PICK1 




0.535 


Al DLJA OA AD 

ALPHA-2A AR 


LJ D C D D A mXlX 1 1 AD/^DDIXDI\/ 

HDFRRAFKKILARGDRKRIV 


PICK1 




0.769 


A 1 Dl 1 A OA AD 

ALPHA-2A AR 


1 1 I~DDAr~lXIXII AD D D IX D 1 \ / 

HDFRRAFKKILARGDRKRIV 


PIST 




0.144 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PIST 




0.35 


A 1 DU A OA AD 

ALPHA-2A AR 


LJ D ITDDACriXlXII AD D D IX D 1 V / 

HDFRRAFKKILARGDRKRIV 


PRIL16 




0.3105 


AIDLJA OA A D 

ALPHA-2A AR 


1 i 1— » ^railAI^IXIXII AD t~\ D IX D l \ / 

HDFRRAFKKILARGDRKRIV 


PRIL16 




0.292 


A 1 D LJ A OA A D 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PRIL16 


2 


0.2165 


AIDllA OA AD 

ALPHA-2A AR 


L 1 D ^DDA^IXIXII AD /"» D D IX D 1 V / 

HDFRRAFKKILARGDRKRIV 


PRIL16 


2 


0.173 


A 1 1 i A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PRIL16 


1 ,2 


0.5495 


A 1 n 1 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PRIL16 


1.2 


0.6355 


A 1 1 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PSD95 


1 


0.2165 


AIDIIA OA AD 

ALPHA-2A AR 


1 1 D ^DDA^IXIXM AD ^ D D IX D 1 V / 

HDFRRAFKKILARGDRKRIV 


PSD95 


1 


0.1 15 


A ■ m i A o A A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PSD95 


3 


0.161 


A 1 r>l 1 A ^ A A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PSD95 


3 


0.1085 


Air^iiA OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PSD95 


1.2,3 


0.341 


A 1 n I 1 A o A A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


PSD95 


1.2,3 


0.4045 


Al DLJA OA AD 

ALPHA-2A AR 


LJ D r D D A IXIXII A D r^D IXD l\ / 

HDFRRAFKKILARGDRKRIV 


PTN-4 


1 


0.384 


Al DLJA OA AD 

ALPHA-2A AR 


LJ r^^DD A dlXIXIl AD/^DDIXDIV/ 

HDFRRAFKKILARGDRKRIV 


DTM A 

PTN-4 


1 


0.425 


AIDLJA OA AD 

ALPHA-2A AR 


LJ D D D A ITIX IX 1 1 AD/^DDIXDI\/ 

HDFRRAFKKILARGDRKRIV 


PTPL1 


1 


0.2225 


Al DLJA OA AD 

ALPHA-2A AR 


LJ D tr D D A mX IX 11 AD/^DDIXDI\/ 

HDFRRAFKKILARGDRKRIV 


PTPL1 


1 


0.163 


Al DLJA OA AD 

ALPnA-2A AK 


LJ D CTD D A nx IXI 1 AD/^DDIXDI\/ 

HDFRRAFKKILARGDRKRIV 


DTDl H 

PTPL1 


0 

2 


1 .6145 


Al DLJA OA AD 

ALPnA-2A AR 


1—1 rXCD D A CIX IXI 1 AD/^DDIXDI\/ 

HDrKKArKKILARGDRKRIV 


DTDI H 

PTPL1 


0 
2 


4 A CO 

1.452 


Al DI-IA OA AD 

ALKnA-2A AK 


l_mCDD A CIXIXII AD/^nDIXDI\/ 

HDFRKArKKILARGDRKRIV 


PTPL1 


0 
0 


0.1595 


Al DLJA OA AD 

ALPHA-2A AR 


LJ D^D D A ^IXIXI 1 A D X^ D IXD 1 V / 

HDFRRAFKKILARGDRKRIV 


PTPL1 


3 


0.1905 


Al DLJA OA AD 

ALPHA-2A AR 


LJ r\PD D A dX IXI 1 ADX^DDIXDIX/ 

HDFRRAFKKILARGDRKRIV 


PTPL1 


4 


0.265 


A 1 Dl_l A OA AD 

ALPHA-2A AR 


LJ DCTD D A CriX IX 1 1 AD/^DDIXDi\/ 

HDFRRAFKKILARGDRKRIV 


PTPL1 


4 


0.4135 


Al DLJA OA AD 

ALPHA-2A AR 


l—l D 1^ D D A dX IX 1 1 AD/^DDIXDI\/ 

HDFRRAFKKILARGDRKRIV 


PTPL1 


*- 
5 


0.1895 


Al DLJA OA AD 

ALPHA-2A AR 


LJ D tr D D A IX IX 1 1 AD/^DDlXDi\/ 

HDFRRAFKKILARGDRKRIV 


PTPL1 


5 


0.3 


AIDLJA OA AD 

ALPHA-2A AR 


1 1 fTDDAriXlXII A D D D IX D 1 \ / 

HDFRRAFKKILARGDRKRIV 


SHANK 1 


1 


r\ 0 0 4 

0.281 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


SHANK 1 


1 


0.2205 


A 1 1 1 A ^ A A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


SIP1 


2 


0.332 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


SIP1 


2 


0.205 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


SITAC 18 


1 


3.8915 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


SITAC 18 


1 


3.297 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


SITAC 18 


2 


3.8365 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


SITAC 18 


2 


4 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


SYNTROPHIN 1 ALPHA 


1 


1.11 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


SYNTROPHIN 1 ALPHA 


1 


1.78 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


SYNTROPHIN GAMMA 1 


1 


1.2706 
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AVC NAME 


SEQUENCE 


GENE NAME 


UUMAIN 


A\/CDA^C /^D 

AVbKAGb UD 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


0\/K ITD /^ DLJ 1 Kl /^AlV/IRflA A 

SYNTROPHIN oAMIVIA 1 


\ 


A 'i 0£S 

1.126 


ALPHA-2A AR 


l_J 1~\ C D D A C 1 1 A D l~\ D !✓ D l\ / 

HDFRRAFKKILARGDRKRIV 


oYN 1 RUHHIN GAMMA Z. 


\ 


r\ oec 
U.2DO 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


oYN 1 RUr'HIN CjAMMA Z 





O.IOD 


A 1 n 1 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


1 AX^-LIKb rKU I bIN 


\ 




A 1 m 1 A OA A n 

ALPHA-2A AR 


LJ fro D A riv' !✓ 11 A D r^D IX D 1 \ / 

HDFRRAFKKILARGDRKRIV 


1 AAZ-LIKb rKU 1 bllN 


\ 


n ceo 


A 1 r^i 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


Tl A Ail 4 

TIAM1 


\ 


o >i >t c 
O.o44o 


A 1 1 A O A A 

ALPHA-2A AR 


1 1 r> f~ D n A ^ !✓ ly 1 1 ad ^ d d !✓ d i \ / 

HDFRRAFKKILARGDRKRIV 


~ri A hJi'i 

1 IAM1 


J 


0.435 


A 1 m 1 A OA A r~% 

ALPHA-2A AR 


U r" D D ACTIXIXII AD D D !✓ D 1 \ / 

HDFRRAFKKILARGDRKRIV 


1 lAM^ 


J 


U.Z440 


A 1 r\ 1 i A O A A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


Tl A K ilO 

TIAM2 


\ — — 


0.378 


A 1 m 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGQRKRIV 


TIP 1 


\ 


O.oOz 


A ■ 1 A O A A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


TID** 
TIP1 


J 


1.309 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


TIDO 

TIP2 


] 


0.4165 


ALPHA-2A AR 


1 1 w~ fO A ^ ix 1 1 A lo lo IV/ 

HDFRRAFKKILARGDRKRIV 


TIP2 


\ 


rv cncc 

0.6065 


ALPHA-2A AR 


1 1 •"v ^ lO A ^ !✓ 1 1 A fO lO D 1 V / 

HDFRRAFKKILARGDRKRIV 


\ / A DTI 1 1 

VARTUL 


J 


O.Zo/, 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


V / A DTI 1 1 

VARTUL 




r» -1 coc 
0.1 OZO 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


\ / A DTI II 

VARTUL 


o 

2 


o OOOC 

0.3335 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


VARTUL 


2 


0.2375 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


\ / A DTI II 

VARTUL 


3 


0.2985 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


VARTUL 


o 

3 


fy t ooc 

0.1235 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


V / A DTI 1 1 

VARTUL 


4 


U.oOz 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


\ / A DTI 1 1 

VARTUL 


4 


O.loUO 


ALPHA-2A AR 


i 1 ff~ ^\ A ^ IV IV 11 A ^\ ^\ IV 1 \ / 

HDFRRAFKKILARGDRKRIV 


\ / A DTI 1 1 

VARTUL 


4 O 

1 .z 


U.ODOO 


ALPHA-2A AR 


1 1 ^ A ^ IV IV 1 1 A v^ iv i v / 

HDFRRAFKKILARGDRKRIV 


\ / A DTI 1 1 

VARTUL 


O 


n CAR 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


V <4 <l D^TA 

X-11 BETA 


1 


1 .o4o5 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


V H DCTA 

A-11 BETA 


•1 


1 .O^DO 


A 1 n 1 1 A OA A f\ 

ALPHA-2A AR 


1 1 r-N r— r~\ T~\ A X~\y\y\\ AD/^DDl^DIV/ 

HDFRRAFKKILARGDRKRIV 


Y ■I •i QCXA 

A-1 1 Dt 1 A 


o 

z 


u.ozuo 


A 1 rt 1 1 A OA A 

ALPHA-2A AR 


l_l D C D D A CTL^ l/"! 1 A D 1^ D D 1 \ / 

HDFRRAFKKILARCjUKKKIV 


A-1 1 bfc 1 A 


o 




A 1 ■ 1 A OA A 

ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


A- 1 1 bfc 1 A 




Z.Oo 


A 1 1 • A OA A t~\ 

ALPHA-2A AR 


l_J D IT D D A ly 1 1 AD/^DDU'DIV/ 

HDFRRAFKKILARGDRKRIV 


V -1 -1 OCT A 

A-1 1 DC 1 A 


1 ,z 


o.Dyoo 


A 1 n I ■ A o A A 

ALPHA-2A AR 


LJ D ^D D A CriX IX 1 1 ADODDl^DI\/ 

HDrKKArKKILAKGUKKKIV 




1 
1 


O. f oo 


Al m_l A OA AD 

ALPnA-2A AR 




LXJ- \ 


o 




Al Ol_IA OA AD 

ALPrlA-2A AR 


HUrKKArr\l\ILAKoUKI\KI V 


7C\ -1 


o 




AinLJA OA AD 

ALPHA-2A AR 


l_l D r D D A d/ !✓ 1 1 A D /"* DD l/^D l\ / 

HDrRRArKKILAKGUKKKIV 


lX)-\ 




u.oouo 


A 1 r*t ■ 1 A OA A r*i 

ALPHA-2A AR 


LI D rr D D A d/ lyi 1 adoddi/di\/ 

HDFRRAFKKILARGDRKRIV 


^lU- 1 


o 
o 


u. / ooo 


A 1 1 ■ A ^\ A A 

ALPHA-2A AR 


U D D D A ny ly 1 1 AD^DDL/DIX/ 

HDFRRAFKKILARGDRKRIV 


Z.L;-id 




U.DDOO 


A 1 ^% 1 ■ A ^\ A A 

ALPHA-2A AR 


I 1 D 1^ D D A ny 1 1 AD/^DDIXDIV/ 

HDFRRAFKKILARGDRKRIV 


ZO-Z 


1 


u.4uyo 


A 1 ■ ■ A A A 

ALPHA-2A AR 


1 1 f— |— » |-« A ^ !✓ !✓ 1 1 AD/^DDIXDH/ 

HDFRRAFKKILARGDRKRIV 


~jr\ o 
ZO-2 


o 

z 


1 .0/ r 0 


A 1 lO 1 1 A O A A 

ALPHA-2A AR 


1 1 D r D D A m/l/l 1 AD/^DDI/DIV/ 

HDFRRAFKKILARGDRKRIV 


ZO-2 


o 
£. 


1 .OoOO 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ZO-2 


3 


0.1415 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ZO-2 


3 


0.2935 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ZO-3 


1 


0.578 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ZO-3 


1 


0.746 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ZO-3 


2 


2.5585 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ZO-3 


2 


3.245 


ALPHA-2A AR 


HDFRRAFKKILARGDRKRIV 


ZO-3 


3 


0.2365 


alpha-2A AR 


HDFRRAFKKILARGDRKRIV 


ZO-3 


3 


0.4715 



Table 8B: 



AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


alpha-2B AR 


QDFRRAFRRILARP\An-QTAW 


AF6 




1.986 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


AF6 




2.387 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


AF6 




2.232 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


AlPC 




1.5395 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


AlPC 




0.576 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


AlPC 




1.02€ 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE CD 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


' AlPC 


1 


1.7515 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


AlPC 


3 


0.404 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


AlPC 


3 


0.788 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


AlPC 


4 


1.117 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


AlPC 


4 


0.50E 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ALP 


1 


0.952 


aipha-2B AR 


QDFRRAFRRILARPWTQTAW 


ALP 




1.337e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


APXL1 


1 


2.00£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


APXL1 


1 


0.97c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


CARD14 


1 


1.878£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


CARD14 


1 


1.144 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


CASK 


1 


2.224£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


CASK 


1 


1.905 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


CASK 


1 


2.13S 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


CNK1 


1 


1.3535 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


CNK1 


1 


0.8095 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Cytohesin binding Protein 


1 


1.96E 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Cytohesin binding Protein 


1 


2.1155 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Cytohesin binding Protein 


1 


1.878 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG1 


1 


1.4£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG1 


1 


0.93£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG1 


2 


1.597 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG1 


2 


1.1225 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG1 


3 


1.14 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG1 


3 


2.0895 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG1 


1,2 


2.082 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG1 


' 1.2 


2.4735 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG1 


1.2 


2.1545 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG2 


1 


0.6645 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG2 


1 


0.885 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG2 


2 


0.7655 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG2 


2 


1 .3695 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG5 


1 


1 .0645 


alpha--2B AR 


QDFRRAFRRILARPWTQTAW 


DLG5 


1 


0.6255 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG5 


2 


2.2525 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG5 


2 


2.822 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG5 


2 


2.4085 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG5 


2 


1.1375 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DLG5 


2 


0.568 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DVL2 


^ 


1.1125 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DVL2 




1.962 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DVL3 




2.5155 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


DVL3 




2.0525 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


EBP50 




0.7175 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


EBP50 




1.3475 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


EBP50 


2 


0.6575 
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.AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE CD 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


EBP50 


2 


1.1^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


EBP50 


1.2 


1.1^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


EBP50 


1.2 


0.603£ 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ENIGMA 


1 


0.849£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ENIGMA 


1 


1.5175 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ERBIN 


1 


0.783e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ERBIN 


1 


1.4045 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


FLJ00011 


1 


0.6075 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


FLJ00011 


1 


1.2535 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


FLJ11215 




1.1605 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


FLJ11215 


1 


0.5095 


atpha-2B AR 


QDFRRAFRRILARPWTQTAW 


FLJ12615 


1 


0.5005 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


FLJ12615 


1 


1.012 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


FLJ21687 


1 


1.204 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


FLJ21687 


1 


0.62E 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


GRIP 1 


4 


0.5325 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


GRIP 1 


4 


2.5575 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


GRIP 1 


5 


0.6365 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


GRIP 1 


5 


0.9375 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


GRIP 1 


6 


1.51£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


GRIP 1 


6 


0.992 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


GRIP 1 


7 


0,7745 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


GRIP 1 


7 


. 0.8€ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Guanine exchange factor 


1 


0.56 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Guanine exchange factor 


1 


1.2065 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


HEMBA 1003117 


1 


1.3575 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


HEMBA 1003117 


1 


0.546 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


HEMBA 1003117 


1 


0.7805 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


HEMBA 1003117 


1 


1.432 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


1 


i.i9e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


1 


1.2095 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


2 


1.2635 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


2 


1.2545 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


3 


2.2165 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


3 


1.3695 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


4 


1.79c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


4 


1.582 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


5 


2.16S 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


5 


1.646 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


7 


1.925 


atpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


7 


1.331 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


8 


2.6575 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


INADL 


a 


2.4085 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0316 


1 


1.1905 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0316 


1 


0.6525 


atpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0340 


1 


o.6oe 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE CD 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0340 




1.175 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0380 




2.442 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0380 


1 


1.891£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0380 




2.731 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0382 




0.574£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0382 




1.117£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0440 




2.671 £ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0440 




1.761£ 


a]pha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0440 




2.981£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0559 




1.381£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0559 




1.677 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0751 




. 1.693£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0751 




2.147E 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0751 




1.485 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0858 




1.7685 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0858 




1.134 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0967 




0.504 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA0967 




0.86S 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1095 




1.5 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1095 




0.8115 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1222 




0.9555 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1222 




0.57 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1284 




0.5985 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1284 


1 


1.537 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1415 




0.598 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1415 




2.3885 


a1pha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1526 




0.6885 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1526 




1.462 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1526 




1.3295 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1526 


1 


0.931 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1526 


2 


0.6855 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1526 


2 


1,6875 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1526 


2 


0.804 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1526 


2 


0.534 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1620 


1 


0.575 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1620 


1 


1.9325 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


1 


0.6145 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


1 


1.5 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


2 


1.448 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


2 


0.5935 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


3 


3.5805 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


3 


2.3ie 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


4 


0.522 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


4 


1.094 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


5 


0.6855 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


5 


1.6365 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE CD 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


6 


0.8£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


KIAA1719 


6 


1.713 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


LIM mystique 


1 


0.6555 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


LIM mystique 




o.7oe 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


LIM protein 


1 


1.847 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


LIM protein 


1- 


2.10C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


LIM-RIL 


1 


1.9115 


aipha-2B AR 


QDFRRAFRRILARPWTQTAW 


LIM-RIL 


1 


1.1165 


alpha*2B AR 


QDFRRAFRRILARPWTQTAW 


LIMK1 


1 


1.6515 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


LIMK1 


1 


1.7335 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


LIMK2 


1 


2.96c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


LIMK2 


1 


2.19e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


LU-1 


1 


0.71€ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


LU-I 


1 


0.6275 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


1 


1.5685 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


1 


0.9585 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


1 


3.5185 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


1 


3.231 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


3 


1.862 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


3 


1.2295 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


3 


1 .4925 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


3 


1.6005 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


4 


A 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


4 


A 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


. MAGI 1 


5 


1.267 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 1 


5 


1.295 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


1 


0.755 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


1 


1.3725 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


2 


0.50€ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


2 


0.8235 


aipha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


3 


2.22€ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


3 


2.92 


a)pha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


4 


0.45 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


4 


0.9925 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


5 


1.9195 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


5 


0.772 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


6 


1.487 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Magi 2 


6 


0.555 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 3 


1 


1.8545 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 3 


1 


2.57e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 3 


2 


2,0285 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 3 


2 


0.1245 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 3 


3 


2.02 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 3 


3 


1.34€ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 3 


4 


1.212 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 3 


4 


1.7545 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE CD 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 3 


5 


2.17^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAGI 3 


5 


1.447 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAST1 


1 


i.85e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAST1 




1.559£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAST2 


1 


4.051 £ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAST2 


1 


2.495£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAST4 


1 


3.97 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAST4 


1 


2.581 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MAST4 


1 


A 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MINT1 




1.561£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MINT1 


1 


0.8725 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MINT1 


2 


1.3535 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MINT1 


2 


0.848£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MINT1 


1,2 


2.74^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MINT1 


1.2 


3.08^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MPP1 


1 


2.2ie 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MPP1 


1 


2.2205 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MPP2 


1 


3.5385 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MPP2 


1 


2.4015 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


1 


C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


1 


3.9855 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


2 


C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


2 


3.77^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


3 


3.9815 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


3 


C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


4 


3.8085 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


4 


C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


5 


3.9975 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


5 


C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


6 


C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


6 


3.92C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


7 


C 


atpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


7 


A 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


8 


C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


8 


3.818 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


9 


C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


9 


A 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


10 


0.967 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


10 


2.152 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


11 


0.57c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


11 


1.192 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


12 


0.62c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


12 


1.173 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


13 


1.0195 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


MUPP1 


13 


2.4275 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


NeDLG 


1 


1.1145 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE CD 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


NeDLG 


1 


1.95c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


NeDLG 


2 


1.564£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


NeDLG 


2 


0.934£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


NeDLG 


3 


3.53^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


NeDLG 


3 


3.826£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


NeDLG 


1.2 


2.989£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


NeDLG 


1.2 


2.448£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


NOS1 


1 


3.5405 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


NOS1 


1 


2.515 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


novel PDZ gene 


1 


1.7425 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


novel PDZ gene 


1 


1.19c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


novel PDZ gene 


2 


2.1985 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


novel PDZ gene 


2 


1.4345 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


outer membrane 


1 


0.6£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


outer membrane 


1 


1.312 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


p65T 


1 


0.488 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


p55T 


1 


0.8315 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PAR3 


3 


1.39e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PAR3 


3 


0.597 


a(pha-2B AR 


QDFRRAFRRILARPWTQTAW 


PAR6 


1 


i.6ie 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PAR6 


1 


2.27£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PAR6 GAMMA 


1 


0.3865 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PAR6 GAMMA 


1 


0.91^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZ-73 


2 


2.112 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZ-73 


2 


1.3175 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZK1 


1 


0.7315 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZK1 


1 


1.502 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZK1 


2 


1.5125 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZK1 


2 


2.7415 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZK1 


3 


0.72e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZK1 


3 


1.374 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZK1 


4 


0.82e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZK1 


4 


1.361 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZK1 


2,3,4 


2.1345 


aipha-2B AR 


QDFRRAFRRILARPWTQTAW 


PDZK1 


2,3,4 


2.597 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PICK1 


1 


2.45€ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PICK1 


1 • 


1.3835 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PICK1 


1 


0.6615 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PICK1 


1 


1.4495 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


PIST 


1 


1.502 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PIST 




0.587 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


prlL16 




0.9865 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


prlL16 




0.47-^ 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


prlL16 


2 


0.4355 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


prILie 


2 


0.897 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


prlL16 


1.2 


2.0705 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


alpha-2B AR 


QDFRRAFRRILARP\An"QTAW 


prlL16 


1.2 


1.933£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PSD95 


1 


0.866 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PSD95 


1 


1.5915 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PSD95 


3 


2.97€ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PSD95 


3 


3.742 


alpha*2B AR 


QDFRRAFRRILARPWTQTAW 


PSD95 


1.2.3 


A 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PSD95 


1,2.3 


A 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTN-4 


1 


2.ii4e 


aipha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTN.4 


1 


2.1945 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTPL1 


1 


0.4725 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTPL1 


1 


1.011 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTPL1 


2 


0.68e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTPL1 


2 


2.9835 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTPL1 


3 


0.3955 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTPL1 


3 


1.8235 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTPL1 


4 


0.5795 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTPL1 


4 


2.3485 


atpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTPL1 


5 


1.79c 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


PTPL1 


5 


0.4€ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Shank 1 


1 


3.155 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Shank 1 


1 


1.54c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Shank 3 


1 


2.6715 


aipha-2B AR 


QDFRRAFRRILARPWTQTAW 


Shank 3 


1 


3.408 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Shank 3 


1 


1.3155 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


SIP1 




0.985 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


SIP1 




1.271 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


SITAC 18 


1 


0.5955 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


SITAC 18 


1 


1 .087 


aIpha:2B AR 


QDFRRAFRRILARPWTQTAW 


SITAC 18 




1.05€ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


SITAC 18 




2.0765 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Syntrophin 1 alpha 


1 


2.42e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Syntrophin 1 alpha 


1 


2.6C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Syntrophin gamma 1 


1 


0.5265 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Syntrophin gamma 1 


1 


1.792 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Syntrophin gamma 2 


1 


0.59c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


Syntrophin gamma 2 


1 


2.3375 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


TAX2-like protein 


1 


1.31^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


TAX2-like protein 


1 


1.54^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


TIAM1 


1 


1.63C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


TIAM1 




2.46C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


TIAM2 




0.78e 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


TIAM2 




1.3665 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


TIP1 




A 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


TIP1 




A 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


TIP2 




1.43c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


TIP2 




i.76e 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


1 


2.282£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


1 


1.232 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


2 


0.6885 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


2 


1.187 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


3 


0.633£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


3 


1.513£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


4 


0.49 1£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


4 


0.99£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


1.2 


1 .492 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


1,2 


1.401 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


VartuI 


1.2 


1.912 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


X-1 1 beta 


1 


1.325£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


X-11 beta 


1 


0.754£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


X-11 beta 


2 


0.4925 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


X-11 beta 


2 


0.999£ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


X-11 beta 


1,2 


2.02^ 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


X-11 beta 


1.2 


1.815 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-1 


1 


1.7365 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-1 


1 


0.711 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-1 


2 


0.7205 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-1 


2 


1.2305 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-1 


3 


0.97 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-1 


3 


0.681 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-2 


1 


1.53e 


a!pha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-2 


1 


0.74^ 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-2 


2 


1.30€ 


a1pha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-2 


2 


0.80E 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-2 


3 


0.641 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-2 


3 


1.12 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-3 


1 


1.7115 


aIpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-3 


1 


3.35E 


alpna-2B AR 


QUrKKArKKILAKrW 1 U 1 AW 


/L\J-o 


1 




alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-3 


1 


c 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-3 


2 


3.742 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-3 


2 


C 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-3 


3 


3.4125 


alpha-2B AR 


QDFRRAFRRILARPWTQTAW 


ZO-3 


3 


C 



Table 8C: 



AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


AF6 




1.94c 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


AF6 




1 .7465 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


AlPC 




1.6195 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


AlPC 




2.45^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


AlPC 




3.4005 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE CD 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


AlPC 


1 


2.5865 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


AlPC 


3 


1.71-1 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


AlPC 


3 


1.376£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


AlPC 


4 


1.839£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


AlPC 


4 


1 .664£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ALP 


1 


3.092 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ALP 


1 


1 .876£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


APXL1 


1 


2.002 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


APXL1 


1 


3.406£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


CARD14 


1 


A 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


CARD14 


1 


4,072£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


CASK 


1 


2.11c 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


CASK 


1 


1.8105 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


CNK1 


1 


1.862 


a!pha-2C AR 


DFRPSFKHILFRRARRGFRQ 


CNK1 


1 


2.76C 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Cytohesin binding Protein 


1 


2.343 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Cytohesin binding Protein 


1 


2.031 £ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG1 


1 


1.091£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG1 


1 


1.677 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG1 


2 


1 .800£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG1 


2 


1 .289£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG1 


3 


1.949£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG1 


3 


3.024 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG1 


1.2 


2.01c 


a!pha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG1 


1.2 


2.253£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG2 


1 


1 .462 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG2 


1 


1.767£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG2 


2 


1.196 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG2 


2 


1.643£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG5 


1 


2.230£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG5 


1 


1.772£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG5 


2 


2.643£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG5 


2 


2.722 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG5 


2 


2.638£ 


■alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DLG5 


2 


2.034£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DVL2 


1 


2.33c 


a!pha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DVL2 


1 


3.34£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DVL3 


1 


3.16£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


DVL3 


.1 


3.479£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


EBP50 


1 


1.364 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


EBP50 




1.977£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


EBP60 


2 


1.49E 


alpha-2G AR 


DFRPSFKHILFRRARRGFRQ 


EBP50 


2 


1.894 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


EBP50 


1.2 


1.351 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


EBP50 


1.2 


1.725£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ENIGMA 


1 


1.375£ 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ENIGMA 




2.021£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ERBIN 




1.515£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ERBIN 




1.606£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


FLJ0001 1 




1.499£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


FLJ00011 




1.8335 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


FLJ11215 




1.292 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


FLJ11215 




1.173£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


FLJ12615 




1.356£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


FLJ12615 




1.1595 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


FLJ21687 




1 .8625 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


FLJ21687 




1.42€ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


3 


1.6445 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


3 


1.331 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


4 


3.5815 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


4 


3.0575 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


5 


2.0285 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


5 


1.5895 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


5 


C 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


5 


1.222 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


6 


1.628 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


6 


1.3525 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


7 


.1.77 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


GRIP 1 


7 


1.581 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


HEMBA 1003117 




1.522 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


HEMBA 1003117 




1.8805 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


HEMBA 1003117 




2.0185 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


HEMBA 1003117 




1.7865 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 




1.6715 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 




2.1475 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


2 


1.82e 


aipha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL - 


2 


2.7205 


a!pha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


3 


2.00S 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


3 


2.43€ 


a!pha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


4 


2.9215 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


4 


3.7865 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


5 


1 .7905 


alpha"2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


5 


3.2295 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


7 


1.4955 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


7 


2.885 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


8 


3.8525 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


INADL 


8 


o one c 

2.6055 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0316 




1.9455 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0316 




1.6115 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0340 




1.365 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0340 




2.137 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0380 




2.455 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE CD 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0380 




2.437£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0382 




1 .640£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0382 




1.828£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0440 




3.206£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0440 




2.575£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0559 




1.641 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0559 




3.050£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0751 




2.222£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0751 




1.890£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0858 




1.75€ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0858 




2.30e 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0967 




1.672 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA0967 




1.677 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1095 




2.102 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1095 




2.791 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1222 




1.72£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1222 




1.89€ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1284 




1.031£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1284 




1.546 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1415 




1.252 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1415 




1.41 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1526 




1 .3335 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1526 




1.6£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1526 




1.847 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1526 




3.853£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1526 


2 


2.1255 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1526 


2 


1 .9005 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1526 


2 


1.30S 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1526 


2 


1.671 


aIpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1620 


1 


1 .037e 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1620 


1 


1 .698£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1719 


1 


1.90£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1719 


1 


1 .875£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1719 


2 


1.541 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1719 


2 


1.214 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1719 


3 


4 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1719 


3 


4.096 


alpha-2C' AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1719 


5 


1.841 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1719 


5 


1 .4085 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


KIAA1719 


6 


2.097£ 


aipna-^L/ Mrs 


nPRPQPk'MII FRRARRriFRn 
Ui r\r Or i\nii_r r\r\Mr\r\orr\v*i 


KIAA171Q 


6 


1.874£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LIM mystique 




1.842£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LIM mystique 




1.317 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LIM protein 




1 .720£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LIM protein 




2.719£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LIM-RIL 




1.87 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE CD 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LIM-RIL 


1 


3.0615 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LIMK1 


1 


1.67c 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LIMK1 


1 


2.5345 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LIMK2 


1 


1.905 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LIMK2 


1 


2.895 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LU-1 


1 


3.88C 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


LU-1 


1 


3.1685 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


1 


2.852 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


1 


1.86€ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


1 


3.3655 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


1 


2.637 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


3 


2.2145 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


3 


2.8475 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


3 


2.165 


aIpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


3 


3.515 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


4 


1.997 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


4 


2.597 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


5 


1.85 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 1 


5 


2.221 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


1 


1.727 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


1 


1.9255 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


2 


1.772 


aIpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


2 


1.1935 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


3 


1 .6635 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


3 


1.33€ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


4 


1.62^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


4 


1.33c 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


5 


1.927 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


5 


2.0965 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


6 


1.701 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Magi 2 


6 


2.0215 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 3 


1 


1.99^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 3 


1 


2.8775 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 3 


2 


1.987 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 3 


2 


3.067 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 3 


3 


1.7405 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 3 


3 


2.8285 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 3 


4 


2.5175 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 3 


4 


1.6^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 3 


5 


2.86£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAGI 3 


6 


2.0255 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAST1 




2.06 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAST1 




3.687 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAST2 




1.953 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAST2 




3.8615 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAST4 




2.492 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MAST4 


1 


2.457£ 


aIpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MINT1 


1 


1.31 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MINT1 


1 


2.332 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MINT1 


2 


3.812£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MINT1 


2 


2.247£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MINT1 


1.2 


3.001 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MINT1 


1,2 


3.689£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MPP1 


1 


2.130e 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MPP1 


1 


2.432 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MPP2 


1 


2.10^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MPP2 


1 


2.95€ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


1 


1.981 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


1 


1 .200e 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


2 


1 .8855 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


2 


1.4785 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


3 


1.075£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


3 


2.03C 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


4 


3.332 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


4 


2.1975 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


4 


o.3e 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


5 


1 .8985 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


5 


1.8305 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


6 


2.2€ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


6 


2.3995 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


. MUPP1 


7 


1.871 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


7 


1.8225 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


8 


1.5675 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


8 


1.775 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


9 


1.80^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


9 


1 .8075 


aipha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


10 


1 .8495 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


10 


1.9885 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


11 


1.45e 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


11 


1.855 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


12 


1.235 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


12 


1.7415 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


13 


1.377 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


13 


2.7645 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


NeDLG 


1 


1 .0965 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


NeDLG 


1 


1.81C 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


NeDLG 


2 


1.2775 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


NeDLG 


2 


1.8C 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


NeDLG 


3 


1.2205 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


NeDLG 


3 


2.2405 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


NeDLG 


1.2 


1.8635 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


NeDLG 


1.2 


1.962 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


aipha-2C AR 


DFRPSFKHILFRRARRGFRQ 


NOS1 


1 


3.020£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


NOS1 


1 


2.094£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


novel PDZ gene 


1 


2.349£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


novel PDZ gene 


1 


3.23£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


novel PDZ gene 


2 


2.315£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


novel PDZ gene 


2 


2.933£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


outer membrane 


1 


1 .2725 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


outer membrane 


1 


2.04c 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


p55T 


1 


1.3015 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


p55T 


1 


1.561 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PAR3 


3 


1.118£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PARS 


3 


1.4575 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PAR6 


1 


2.567 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PAR6 


1 


2.997 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PAR6 GAMMA 


1 


1.345 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PAR6 GAMMA 


1 


1 .0305 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZ-73 


2 


1.965 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZ-73 


2 


3.1455 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZK1 


1 


1 .4275 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZK1 


1 


1 .9785 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZK1 


2 


1.04C 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZK1 


2 


1.87e 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZK1 


3 


1.4145 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZK1 


3 


1.9415 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZK1 


4 


1.6115 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZK1 


4 


1.91G 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZK1 


2,3.4 


1.8195 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PDZK1 


2.3.4 


1.C 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PICK1 


1 


2.2435 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PICK1 


1 


3.09^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PIST 


1 


1.005 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PIST 


1 


1 .2995 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


prlL16 


1 


1.413 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


prlL16 


1 


1 .0525 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


prlL16 


2 


1.30e 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


prlL16 


2 


1.0315 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


prlL16 


1.2 


1.6965 


a!pha-2C AR 


DFRPSFKHILFRRARRGFRQ 


prlL16 


1,2 


2.653 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PSD95 


1 


1.2595 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PSD95 


1 


2.0535 


afpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PSD95 


3 


1.191 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PSD95 


3 


1.71€ 


aIpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PSD95 


1,2.3 


2.4695 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PSD95 


1.2.3 


3.968 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTN-4 


1 


1.87c 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTN-4 


1 


2.8045 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


alpha*2C AR 


DFRPSFKHILFRRARRGFRQ 


PTPL1 


1 


0.813£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTPL1 


1 


1.111£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTPL1 


2 


1.37€ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTPL1 


2 


2.24£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTPL1 


3 


0.594£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTPL1 


3 


2.1675 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTPL1 


4 


1 .046£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTPL1 


4 


1.851 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTPL1 


5 


3.292 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


PTPL1 


5 


2.056E 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Shank 1 


1 


1.C 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Shank 1 


1 


i.65e 


a!pha-2C AR 


DFRPSFKHILFRRARRGFRQ 


SIP1 


2 


1.584e 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


SIP1 


2 


2.137 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


SITAC 18 


1 


1.509£ 


aipha-2C AR 


DFRPSFKHILFRRARRGFRQ 


SITAC 18 


1 


4.086 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


SITAC 18 


2 


2.061 £ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


SITAC 18 


2 


A 


aIpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


SYNTENIN 


1 


1 .3545 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


SYNTENIN 


1 


2.247£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


SYNTENIN 


2 


1.36 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


SYNTENIN 


2 


2.5975 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Syntrophin 1 alpha 


1 


2.5625 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Syntrophin 1 alpha 


1 


3.51c 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Syntrophin gamma 1 


1 


1 .5995 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Syntrophin gamma 1 


1 


1.551 


atpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Syntrophin gamma 2 


1 


1.021 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Syntrophin gamma 2 


1 


1.1855 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


TAX2-like protein 


1 


2.065 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


TAX2-like protein 


1 


3.26E 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


TIAM1 


1 


2.51C 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


TIAM1 


1 


3.0965 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


TIAM2 


1 


1 .5895 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


TIAM2 


1 


1.90€ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


TIP1 


1 


2.941 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


TIP1 


1 


2.6375 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


TIP2 


1 


2.421 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


TIP2 


1 


2.8285 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


VartuI 


1 


1 .0885 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


VartuI 


1 


1.45^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


VartuI 


2 


1.662 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


VartuI 


2 


1 .0935 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


VartuI 


3 


1.17^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


VartuI 


3 


2.1 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


VartuI 


4 


1.2^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


VartuI 


4 


1 .8935 
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AVC NAME 


SEQUENCE 


GENE NAME 


DOMAIN 


AVERAGE OD 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


Vartul 


1.2 


1.69£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


VartuI 


1,2 


1 .6875 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


X-11 beta 


1 


2.585£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


X-11 beta 


1 


2.12£ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


X-11 beta 


2 


1.417f 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


X-11 beta 


2 


1.31 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


X-11 beta 


1.2 


1.1835 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


X-11 beta 


1.2 


1.7135 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-1 


1 


1.9365 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-1 


1 


1.715 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-1 


2 


2.4245 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-1 


2 


2.612 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-1 


3 


1.788 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-1 


3 


1.12€ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-2 


1 


1.2275 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-2 


1 


1.4125 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-2 


2 


2.141 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-2 


2 


1.8675 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-2 


3 


2.107 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-2 


3 


1.29^ 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-3 


1 


1.637 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-3 


1 


2.612 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-3 


2 


1.418 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-3 


2 


2.37e 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-3 


3 


1.2515 


alpha-2C AR 


DFRPSFKHILFRRARRGFRQ 


ZO-3 


3 


1 .6585 



Table 8 legend: Tables 8A, SB and 8C show the results of G assay testing (described supra) 



between the three alpha 2 adrenergic subunits and a subset of PDZ domains. All tests are 
performed at lOuM concentration of peptide, and the peptide sequence is displayed in column 
2. The background binding is somewhat high for these peptides (average OD), and a reduced 
number of interactions would be seen with lower peptide concentrations. Duplicate rows of 
PDZ GENE NAME and DOMAIN indicate independent sets of duplicates. A '0' in the 
average OD column indicates failure of the test. 



Table 9: Disorders/biological processes demonstrated to be affected to alpha adrenergic 
modulation 



Receptor 



Disorder/process 



132 



Receptor 


Disorder/process 


Al 


Depression 


Al 


Lower Urinary Tract Storage 


Al 


Migraine 


Al 


Prostate aooDtosis 


Al 


Hvoertroohv. oroliferation and migration of 
vascular smooth muscle after carotid injury 


A2 


Migraine 


A2 


Coronary Flow Reserve following stenting 


A2 


Alzheimer's 


A2 


Parkinson's 


A2 


Neuroprotection 


A2 


Glaucoma 


A2 


Opioid withdrawl 



Conclusions and Summary 

Table 8A,8B and 8C are the first demonstrations that we've discovered of alpha 2 
adrenergicreceptor(A2R) interactions with PDZ domains. Alpha 1 adrenergicreceptors(AlR) 
also contain PL sequences at their C-termini, but different that AlRs, implying binding to a 
different subset of PDZ domains. A single report demonstrated an interaction between alpha 
1 receptor A and NOSl (a PDZ protein; Pupo et al. (2002) BMC Pharmacology2:17), but the 
authors demonstrated that this interaction was not dependent on the PL of the AlA adrenergic 
receptor. Without intending to be limiting, blocking interactions between apha adrenergic 
receptors and PDZ domains can modulate the effect of signaling through these receptors and 
provide a new set of therapeutic targets for treatment of diseases or disease stemming from 
misfunctioning biological processes such as those listed in Table 9. 
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*** 

The present invention is not to be limited in scope by the exemplified 
embodiments which are intended as illustrations of single aspects of the invention and any 
sequences which are functionally equivalent are within the scope of the invention. Indeed, 
5 various modifications of the invention in addition to those shown and described herein will 
become apparent to those skilled in the art from the foregoing description and accompanying 
drawings. Such modifications are intended to fall within the scope of the appended claims. 

All publications cited herein are incorporated by reference in their entirety and 

for all purposes. 
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Table 2. 



Dl Awr* 
rL AVL/ 

ID 


Dl 
rL 


rL ^uivier oequence 




Domai 
n 


Bindin 

g 

Streng 
th 


AA250 


«jn 1 <jr\ ^oci uLui III 1 1 cCrC^iui o 

hydroxytryptamlne 3A) 


LAVLAYSITLVMLWSIWOYA 


HEMBA 1003117 


i 
1 


o 


AA250 


hydroxy tryptamine 3 A) 


LA VLAYSITLVM LWSI WOYA 
V 1 oi 1 ^ vivi^w wi vvvai i n 


CARD14 


1 


2 


AA250 


'\l-IT')A /corntonin roopntnr ^- 

3ll 1 O/A ^oCI V^LWll III 1 1 COCfJtUI \J 

hydroxytryptamine 3A) 


LAVLAYSITLVMLWSIWOYA 

LAA V l_n I Ol 1 ^ V IVI^WOI VVVjf 1 r\ 


MPP2 


-j 


2 


AA233L 


'iHTPB ^<5Protnnln rpreotor 5- 

hydroxytryptamine 2B) 


DTLLLTEN EG DKTEEOVSY V 


MAGI 1 


3 


4 




on 1 £D ^ocl >JIUI III 1 I KS\^tS\J\\Jf iJ 

hydroxytryptamine 28) 


DTLLLTEN EG DKTEEOVSY V 


HEMRA 1003117 

n ^IVI DrA 1 x/wO 1 1 r 


-| 


p 


t. O O I— 


\^III^D ^odL^lLIIIIII 1 KSv^tSfJvK/t \J 

hydroxytryptamine 2B) 


DTLLLTEN EG DKTEEO VS YV 


HEMBA 1003117 


1 


■\ 


A Aoqoi 


f^MXPR ^GProtnnin rpr>pnfnr 

hydroxytryptamine 2B) 


DTI LLTENEGDKTEEOVSYV 


KIAA0316 


■| 


1 




hydroxytryptamine 28) 


DTI 1 1 TFMFGnKTEFOVSYV 


Ki AAnfln7 

rxl rA/AUOw / 


1 


1 




9n 1 £D ^9ci<Ji\ji III 1 txsKfXS\ji\ji %j 

hydroxytryptamine 28) 


DTI 1 1 TENEGDKTFEOVSYV 


KIAA1R34 

ixl/A/A 1 xJOt 


2 


5 




9il 1 ^OCI (JlLli III 1 lCLrC|JlUI »J 

hydroxytryptamine 28) 


DTI 1 1 TFNFGDKTFFOVSYV 


KIAA0fl07 

fx 1 rA/AxiOU / 




i 
1 


A AOQQI 
/A/A^ O 0 1— 


'^l-IXPR /corntonin ronpntrtr 

1 ^Ot^l UllJI III 1 1 KS\^G\J\.\Jl %J 

hydroxytryptamine 28) 


DTLLLTEN EG DKTEEOVSY V 


Mint 1 


2 


1 


AAP33L 


'^MTPR ^Qprntnnin rpr'Pntnr '^•> 

^1 1 1 ^OCI Ulwl III 1 1 C\*^fJA\Jt 

hydroxytryptamine 28) 


DTLLLTEN EG DKTE EOVSY V 


MINT1 


1 ,2 


-j 




'^HTPR /cprntonin rpr'Pntnr ^- 

\Jt 1 1 ^Od UlVi/l ill 1 t K^K^X^lJUJl O 

hydroxytryptamine 28) 


DTLLLTEN EG DKTF EOVSY V 


PTPL-1 


2 


5 




ori 1 £D ^ociuiuiiiii ici«c|Jiui 9~ 

hydroxytryptamine 28) 


DTI 1 1 TFNFGDKTFFOVSYV 


PTPI -1 


4 


p 

c 


AA243 


aPA-AR ^mrtrlifipH^ /nrirpnornir* 

receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 
nLyi nn/Ai r\r\iL^AnvjiL>nr\rii v 


Magi2 


5 


1 


AA243 


receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 


MAG1 1 


g 


3 


AA243 


aPA-AR ^mnriifipH^ /AHrpnprnin 

receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 


CARD14 


■j 


1 


AA243 


nPA-AR ^moHifipH^ ^aHrpnornir* 

d^/A /An ^1 1 vJI 1 ICU^ ^ClUl Cl 1 c;l ^ 11^ 

receptor alpha-2A) 


HDFRRAFKKII ARGDRKRIV 
ni^i nii/Ai rxTxiL^AnvjiLynrxni v 


HFIWIRA 1003117 

n ^IVI DrA 1 \J\Jyj 11/ 




1 


AA243 


^PA-AR /mnHiflpH^ ^arlrpnornir' 

receptor alpha-2A) 


HnFRRAFKKII ARGDRKRIV 


Fl IP1RR7 


1 


-j 




nPA-AR ^rrtnHifioH\ /aHronornif^ 
atr\-rAn ^lilUUlMc^U^ \clUici Ici yiL* 

receptor alpha-2A) 


HDFRRAFKKII ARGDRKRIV 
nu/i nn/Ai 1X1x1 L^AnvjiL/n ixni v 


APXl 1 

/Ai /Xl_ 1 


1 


1 




cit/A~/A n ^iiiLiuiiicu/ ^auiciioryio 

receptor aipha-2A) 


HDFRRAFKKII ARGDRKRIV 
ni^nnn/Ai rxrxiL^AnvjU'nrxni v 


HFMRA 1003117 


-\ 


\ 


AA243 


a2A-AR (modified) (adrenergic 
receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 


INADL 


3 


2 


AA243 


a2A-AR (modified) (adrenergic 
receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 


INADL 


4 


1 


AA243 


a2A-AR (modified) (adrenergic 
receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 


KIAA0340 


1 


1 


AA243 


a2A-AR (modified) (adrenergic 
receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 


KIAA0751 


1 


3 


AA243 


a2A-AR (modified) (adrenergic 
receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 


KIAA0807 


1 


1 
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AA243 


CIC/A /Aft ^1 1 ILfUII ICVjy ^dUi CMCfl 

receptor alpha-2A) 


HnFRRAPKkll ARt^nRkRIV/ 
nL/i nn/Ai ixixiL^AnvjiL/nrvni v 


KIAAIPftA 

ixIrArA 1 tO*T 


1 


1 
1 


AA243 


CI^AA mi ^1 1 IVi/UII ICU/ \Clvll CI 

receptor alpha-2A) 


HnFRRAFKKtl ARf^DRKRiV 
nL/i nn/Ai rxixiL^AnvjiL/nrxni v 


KIAA1 f^Pfi 


-| 


4 


AA243 


a2A*An ^modifipH^ ^artrpnprnip 
receptor alpha-2A) 


HDFRRAFKKILARGnRKRIV 


K1719 


4 


Q 
O 


AA243 


r32A-AR ^modifipH^ fadrpnprnir 
receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 


LIM-Mv^tifiijp 

^iivi ivi jr 9«iv|vic 


1 


1 


AA243 


a2A-AR ^mndifipfi) fadrpnprnip 

receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 


IVIint 1 


1 




AA243 


f^PA-AR ^mndifipH^ /nHrpnprnip 

receptor alpha-2A) 


HnFRRAFKKILARGDRKRIV 

1 ILyl rilirAI IXJxIL^AI l\^l»/f ifxl il V 


j\/|ypp-| 




2 


AA243 


ci^/A rMi ^iiiuuiiidJ^ ^duidid^io 

receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 
fii^i nn/Ai rxrxiL^AnvsL^nixni v 


MIIPPI 


o 
o 


1 


AA243 


receptor atpha-2A) 


HDFRRAFKKII ARGDRKRIV 
niJi nii/Ai rNJviL/AnvjL/nrxni v 


Ml Ippi 


1 o 


i 


AA243 


receptor alpha-2A) 


HDFRRAFKKII ARGDRKRIV 
ni_/i nn/Ai rxr\iL.rAnvjiLi/nr\ni v 


PAR'3| 
1 rAno 


o 
o 


o 
c 


AA243 


d£/A /Ari ^flldUlilCU^ ^dUfdld^tO 

receptor alpha-2A) 


HDFRRAFKKII ARGDRKRIV 
nL^i nn/ArrxrxiL^AnvjL^nrxni v 


PTPL-1 




o 


AA243 


AA AA n 1 Iv/UII idJ/ ^dVJIdlCi^iU 

receptor alpha-2A) 


HDFRRAFKKII ARGDRKRIV 
nLVi nn/Ai r vrxiLVAnvjaL/nrxnj v 


Ol 1 AAO 1 O 


-] 


4 


AA243 


flPA-AR ^mnHifipH^ /siHr^n^arrjIn 

receptor alpha-2A) 


HDFRRAFKKII ARGDRKRIV 

1 iL/i mvAAi 1 XI \ii_rAnv>ii_/ni xni V 


Q|TAn-1R 

0 1 1 /Av^ 1 O 


2 


4 


AA243 


nPA-AR ^modlfipH\ ^Adrpnpmir* 

d^rA /An ^IIIUVJMICVJ/ ^dUIBIId^lt# 

receptor alpha-2A) 


HDFRRAFKKILARGDRKRIV 
riL^r nn/Ai i\rxiL.rAnvJii-/nrxni v 


KIAA1 SPR 

iXl/A/A 1 \J^\J 


2 


i 




aPA-AR {mrtci\ii^ri\ /siHronorni/^ 
a&.rA rAri ^lliuu)llcvj^ ^aiJldidyii« 

receptor alpha-2A) 


HDFRRAFKKII ARGDRKRIV 
niJi nn/Ai ixJxii_/Anvjii»/nrxni v 


/\ 1 1 Udd 


1 


Q 


AA243 


d^/A /An ^11 iLiui) icsvj/ ^dvJi t$i iBi y ii« 

receptor atpha-2A) 


HDFRRAFKKII ARGDRKRIV 
ni^i nn/Ai ixTxi i_/AnvjL/nrxni v 


yi 1-hpta 


2 


1 


AA243 


ci£_/A /A n ^iiiuuiiicvj^ ^dui ci let y iLf 

receptor alpha-2A) 


HDFRRAFKKII ARGDRKRIV 
ni— /I nn/Ai ixixiL/An vjL/nr\ni v 


70-1 


o 

cL 




AA243 


flPA-AR ^mnHifipH^ ^?iHrpnprnir' 

receptor alpha-2A) 


HnPRRAFKKII ARGHRKRIV 

1 nnAAi ixf \iL-rAn\ji_/nixni v 


ZO-2 


2 


2 


AA243 


nPA-AR ^moHifioH^ ^fiHronornio 

receptor alpha~2A) 


HDFRRAFKKII ARGDRKRIV 
nL/i nn/Ai rxrxii.-/AnvjL/nrxni v 




p 

c. 


o 


AA243 


aPA-AR fmoHifiprl\ /siHronornir^ 
d^^'rAri ^iiiuuiiiBvi/ ^dUi ei ici y iw> 

receptor alpha-2A) 


HDFRRAFKKII ARGDRKRIV 
nL^i nn/Ai rxixiL/AnvjiL/nrxni v 


Di GR 


p 


1 




d^/A /An ^IllUUiilcvl/ ^dvjl d lc?iy lO 

receptor afpha-2A) 


HDFRRAFKKII ARGDRKRIV 
riL<*i nn/Ai rxixiL/AnvjL^nrxni v 


AiPP 

Mil O 


1 

1 


p 


AA243 


CL^r\~r\ry ^iiiuuiiicuy \dVJiBiidyic> 

receptor a(pha-2A) 


HDFRRAFKKII ARGDRKRIV 
nijr nn/Ai rxr\ii_/AnvjiLjnr\ni v 


oyi III Ufji III 1 ydfiiiiid 
1 


1 


p 


AA243 


d^AA /An ^niuuitiBU/ ^dui oi ici y i\« 

receptor alpha-2A) 


HDFRRAFKKII ARGDRKRIV 
n Ui n n/Ai ixixi L/Ano L/nrvni v 


ManiP 
iviciyi^ 


c 
o 


1 

1 


AA244 


aPR-AR /mnHifiod\ (stfiretnoirrtlf* 
D /An ^iiiuuiiicu/ ^dui CI iBi y ii# 

receptor alpha-2B) 


ODFRRAFRRII ARPWrOTAW 


1 OL^57i^ 


1 p T 




AA244 


aPR-AR /mnrlifiAH^ ^nHronprnip 
a^D nri ^iiiuuiiiou^ ^duidiciyii^ 

receptor alpha-2B) 


ODFRRAFRRII ARPWTOTAW 
v<<L./i nn/Ai nnii—TAni v v i xjc i rAvv 


TIP1 


1 


5 


AA244 


d^D /An ^fiiuuiimuy ^dvJi ci ici y 

receptor alpha-2B) 


ODFRRAFRRII ARPWTOTAW 

xjII^i nriMrnrilL^nr VV 1 Vrf 1 MVV 


KiAAnnny 


i 
1 


f 


AA244 


a2B-AR (modified) (adrenergic 
receptor alpha-2B) 


QDFRRAFRRI LARPWTQTAW 


KIAA0303 


1 


4 


AA244 


a2B-AR (modified) (adrenergic 
receptor alpha-2B) 


QDFRRAFRRILARPWTQTAW 


MAGI1 


2 


4 


AA244 


a2B-AR (modified) (adrenergic 
receptor alpha-2B) 


QDFRRAFRRILARPWTQTAW 


MAGI1 


4 


5 


AA245 


a2C-AR (modified) (adrenergic 
receptor alpha-2C) 


DFRPSFKHILFRRARRGFRQ 


GRIP1 


5 


1 
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AA245 


a2C-AR (modified) (adrenergic 
receptor alpha-2C) 


DFRPSFKHILFRRARRGFRQ 


LU1 


1 


4 


AA245 


a2C-AR (modified) (adrenergic 
receptor alpha-2C) 


DFRPSFKHILFRRARRGFRQ 


PTPL-1 


5 


3 


AA245 


a2C-AR (modified) (adrenergic 
receptor alpha-2C) 


DFRPSFKHILFRRARRGFRQ 


APXL1 


1 


3 


AA245 


a2C-AR (modified) (adrenergic 
receptor alpha-2C) 


DFRPSFKHILFRRARRGFRQ 


KIAA1719 


3 


5 . 


AA245 


a2C'AR (modified) (adrenergic 
receptor alpha-2C) 


DFRPSFKHILFRRARRGFRQ 


Mint 1 


2 


3 


AA245 


a2C-AR (modified) (adrenergic 
receptor alpha-2C) 


DFRPSFKHILFRRARRGFRQ 


MUPP1 


4 


3 


AA245 


a2C-AR (modified) (adrenergic 
receptor alpha-2C) 


DFRPSFKHILFRRARRGFRQ 


KIAA0973 


1 


3 


AA245 


a2C-AR (modified) (adrenergic 
receptor alpha-2C) 


DFRPSFKHILFRRARRGFRQ 


CARD14 


1 


5 


AA245 


a2C-AR (modified) (adrenergic 
receptor alpha-2C) 


DFRPSFKHILFRRARRGFRQ 


DVL2 


1 


3 


AA252 


ACM3 (muscarinic acetylcholine 

receptor M3) 


QQYQQRQSVIFHKRAPEQAL 


APXL1 


1 




AA252 


ACM3 (muscarinic acetylcholine 
receptor M3) 


QQYQQRQSVIFHKRAPEQAL 


KIAA0807 


1 


^ 




AA252 


ACM3 (muscarinic acetylcholine 
receptor M3) 


QQYQQRQSVIFHKRAPEQAL 


KIAA0807 


1 




AA252 


ACM3 (muscarinic acetylcholine 
receptor M3) 


QQYQQRQSVIFHKRAPEQAL 


AlPC 


1 


^ 

• 


AA181 


BAI-1 (brain-specific angiogenesis 
inhibitor 1) 


RSGATIPLVGQDIIDLQTEV 


TIP1 


1 





AA181 


BAI-1 (brain-specific angiogenesis 
inhibitor 1) 


RSGATIPLVGQDIIDLQTEV 


KIAA1526 


1 




AA181 


BAI-1 (brain-specific angiogenesis 
inhibitor 1) 


RSGATIPLVGQDIIDLQTEV 


PSD95 


2 


^ 


AA1 81 


BAI-1 (brain-specific angiogenesis 
inhibitor 1) 


RSGATIPLVGQDIIDLQTEV 


TIP 43 


1 


^ 


AA181 


BAI-1 (brain-specific angiogenesis 
inhibitor 1 ) 


RSGATIPLVGQDIIDLQTEV 


NeDLG 


1,2 





AA1 81 


BAI-1 (brain-specific angiogenesis 
inhibitor 1) 


RSGATIPLVGQDIIDLQtEV 


KIAA0973 


1 




AA181 


BAI-1 (brain-specific angiogenesis 
inhibitor 1) 


RSGATIPLVGQDIIDLQTEV 


INADL 


3 


^ 


AA181 


BAI-1 (brain-specific angiogenesis 
Inhibitor 1) 


RSGATIPLVGQDIIDLQTEV 


DLG2 


2 


^ 


AA45 


BLR-1 (Burkitt's lymphoma 
receptor 1) 


PSWRRSSLSESENATSLTTF 


KIAA0561 


1 


^ 


AA45 


BLR-1 (Burkitt's lymphoma 
receptor 1 ) 


PSWRRSSLSESENATSLTTF 


PDZK-1 


2 


^ 


AA45 


BLR-1 (Burkitt's lymphoma 
receptor 1) 


PSWRRSSLSESENATSLTTF 


KIAA0807 


1 




AA45 


BLR-1 (Burkitt's lymphoma 
receptor 1) 


PSWRRSSLSESENATSLTTF 


PD2K1 


2,3,4 


^ 


AA45 


BLR-1 (Burkitt's lymphoma 
receptor 1) 


PSWRRSSLSESENATSLTTF 


SHANK 


1 


1 


AA45 


BLR-1 (Burkitt's lymphoma 
receptor 1) 


PSWRRSSLSESENATSLTTF 


KIAA0B07 


1 


2 


AA269 


C5AR (C5a anaphylatoxin 
chemotactic receptor) 


ESKSFTRSTVDTMAQKTQAV 


PTPL-1 


4 


1 


AA29.2 


IL8RB (lnterleukin-8 receptor B) 


KDSRPSFVGSSSGHTSTTL 


KIAA0807 


1 


5 


AA29.2 


IL8RB (lnterleukin-8 receptor B) 


KDSRPSFVGSSSGHTSTTL 


SHANK 


1 


3 
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AA29.2 


IL8RB (lnterleukin-8 receptor B) 


KDSRPSFVGSSSGKTSTTL 


KIAA0382 


1 


2 


AA29.2 


IL8RB (lnterleukin-8 receptor B) 


KDSRPSFVGSSSGHTSTTL 


KIAA0807 


1 


5 


AA215 


CKR5 (CC Chemokine receptor 
type 5) 


ERASSNATTISTGEQEISVGL 


KIAA1719 


5 




AA215 


CKR5 (CC Chemokine receptor 
type 5) 


ERASSVYTRSTGEQEISVGL 


KIAA1719 


2 


^ 


AA215 


CKR5 (CC Chemokine receptor 
type 5) 


ERASSVYTRSTGEQEISVGL 


TAX IP2 


1 


^ 




AA215 


CKR5 (CC Chemokine receptor 
type 5) 


ERASSVYTBSTGEQEISVGL 


TIP1 


1 




AA215 


CKR5 (CC Chemokine receptor 
type 5). 


ERASSVYTRSTGEQEISVGL 


MINT1 


1,2 




AA215 


CKR5 (CC Chemokine receptor 
type 5) 


ERASSVYTRSTGEQEISVGL 


KIAA1634 


1 




AA124 


CXCR3 (C-X-C Chemokine receptor 

type 3) 


SSSRRDSSWSETSEASYSGL 


ELFIN 1 


1 


^ 


AA124 


CXCR3 (C-X-C Chemokine receptor 
tVPe 3) 


SSSRRDSSWSETSEASYSGL 


KIAA0807 


1 


2 


AA124 


CXCR3 (C-X-C Chemokine receptor 

type 3) 


SSSRRDSSWSETSEASYSGL 


KIAA0807 


1 


1 


AA114 


GLUR7 (metabotropic glutamate 
receptor 7) 


VDPNSPAAKKKYVSYNNLVI 


KIAA1634 


1 


1 


AA114 


GLUR7 (metabotropic glutamate 
receptor 7) 


VDPNSPAAKKKYVSYNNLVI 


DLG1 


2 


1 


AA114 


GLUR7 (metabotropic glutamate 
receptor 7) 


VDPNSPAAKKKYVSYNNLVI 


PAR3 


3 


2 


AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


KIAA0807 


1 




AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


KIAA0380 


1 


1 


AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


KIAA1634 


1 


1 


AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


MAGI 1 


2 


1 


AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


PSD95 


1,2,3 




AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


MAG1 1 


6 


1 


AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


Syntrophin 1 alpha 


1 


1 


AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


KIAA1634 


5 


1 


AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


MUPP1 


13 


^ 


AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


novel PDZ gene 


2 


1 


AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


PDZK1 


2,3,4 


1 


AA29.3 


IL-8RA (lnterleukin-8 receptor A) 


ARHRVTSYTSSSVNVSSNL 


TIP1 


1 


1 


AA330 


P2Y1 (P2Y Purinoceptor 1) 


SEDMTLNILPEFKQNGDTSL 


ELFIN 1 


1 


1 


AA330 


P2Y1 (P2Y Purinoceptor 1) 


SEDMTLNILPEFKQNGDTSL 


KIAA0807 


1 




AA330 


P2Y1 (P2Y Purinoceptor 1) 


SEDMTLNILPEFKQNGDTSL 


Magi2 


5 


' 


AA330 


P2Y1 (P2Y Purinoceptor 1) 


SEDMTLNILPEFKQNGDTSL 


KIAA0316 


1 




AA330 


P2Y1 (P2Y Purinoceptor 1) 


SEDMTLNILPEFKQNGDTSL 


EBP50 


1 




AA330 


P2Y1 (P2Y Purinoceptor 1) 


SEDMTLNILPEFKQNGDTSL 


KIAA0807 


1 




AA330 


P2Y1 (P2Y Purinoceptor 1) 


SEDMTLNILPEFKQNGDTSL 


APXL1 


1 


j— 


AA330 


P2Y1 (P2Y Purinoceptor 1) 


SEDMTLNILPEFKQNGDTSL 


PTN-4 


1 




AA330 


P2Y1 (P2Y Purinoceptor 1) 


SEDMTLNILPEFKQNGDTSL 


EBP50 


2 


; 


AA268 


PTR2 (Parathyroid hormone 
receptor) 


RPMESNPDTEGAQGETEDVL 


APXL1 


1 




AA268 


PTR2 (Parathyroid hormone 
receptor) 


RPMESNPDTEGAQGETEDVL 


PAR3 


3 




AA205L 


5HT2C (serotonin receptor 5- 
hydroxytryptamine 2C) 


ENLELPVNPSSWSERISSV 


MUPP1 


10 




AA205L 


5HT2C (serotonin receptor 5- 
hydroxytryptamine 2C) 


ENLELPVNPSSWSERISSV 


INADL 


8 




AA248 


SSR4 (somatostatin receptor type 
4) 


EALQPEPGRKRIPLTRTTTF 


MAG11 


5 
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AA248 


SSR4 (somatostatin receptor type 

V 


EALQPEPGRKRIPLTRTTTF 


MAG1 1 


4 


1 


AA248 


SSR4 (somatostatin receptor type 
4) 


EALQPEPGRKRIPLTRTTTF 


DLG1 


1,2 


1 


AA248 


SSR4 (somatostatin receptor type 
4) 


EALQPEPGRKRIPLTRTTTF 


KIAA0807 


1 


1 


AA248 


SSR4 (somatostatin receptor type 

4) 


bALOHbPGHKHIPL 1 H 1 1 i h 


MINT1 


1,2 


1 


AA248 


SSR4 (somatostatin receptor type 
4) 


bALOPbPGKKKIPLIHI 1 Ih 


PDZK1 


2,3,4 


1 


AA113 


SSTR2 (somatostatin receptor 
type 2) 


LNETTETQRTLLNG DLQTSI 


KIAA0382 


1 


1 


AA113 


SSTR2 (somatostatin receptor 
type 2) 


LNETTETQRTLLNGDLQTSI 


KIAA0807 


1 


2 


AA113 


SSTR2 (somatostatin receptor 
type 2) 


LNETTETQRTLLNGDLQTSI 


KIAA1 526 


1 


1 


AA113 


SSTR2 (somatostatin receptor 
type 2) 


LNETTETQRTLLNGDLQTSI 


KIAA1719 


6 


1 


AA113 


SSTR2 (somatostatin receptor 
type 2) 


LNETTETQRTLLNGDLQTSI 


Mint 1 


2 


1 


AA1 13 


SSTR2 (somatostatin receptor 
type 2) 


LNETTETQRTLLNGDLQTSI 


SHANK 


1 


1 


AA1 13 


SSTR2 (somatostatin receptor 
type 2) 


LNETTETQRTLLNGDLQTSI 


GRIP1 


7 


1 


AA113 


SSTR2 (somatostatin receptor 
type 2) 


LNETTETQRTLLNGDLQTSI 


KIAA0807 


1 


2 


AA1 13 


SSTR2 (somatostatin receptor 
type 2) 


LNETTETQRTLLNGDLQTSI 


MINT1 


1,2 


1 


AA329 


VI PS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


SSTRIP 


1 


3 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


MAGI 1 


2 


1 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


MAG1 1 


5 


1 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


EBP50 


1 


1 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


FLJ0001 1 


1 


2 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


INADL 


8 


1 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


KIAA0382 


1 


1 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


KIAA0807 


1 


3 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


KIAA0807 


1 


3 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


INADL 


3 


1 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


KIAA0973 


1 


1 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


KIAA1526 


2 


2 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


KIAA1526 


1 


1 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


NSP 


1 


1 


AA329 


VIPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


PIST 


1 


1 
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AA329 


VI PS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


Shank 1 


1 


2 


AA329 


VI PS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


Shank 3 


1 


2 


AA329 


ViPS (vasoactive intestinal 
polypeptide receptor 2) 


LQFHRGSRAQSFLQTETSVI 


TIP 43 


1 


1 
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Table 3. 



Gene Name (Synonyms) 


Genbank 
Reference 


Last 20 aa 


Last 4 aa 


PL 

? 


AVCPL 
ID 


Adenosine A1 receptor (AdenoAl R) 


S45235.1 
Gl:256154 


FRCQPAPPIDEDLPEERPDD 


RPDD 






Adenosine A2a receptor (AdenoA2a. ADORA2A) 


X68486.1 
GI:400451 


VCPEPPGLDDPLAQDGAGVS 


AGVS 






Adenosine A2b receptor (AdenoA2b) 


M97759.1 
Gl: 1781 49 


ADVKSGNGQAGVQPALGVGL 


GVGL 


X 




Adenosine A3 receptor (AdenoA3R) 


AAA16365.1 
Gl:349449 


ACWCHPSDSLDTSIEKNSE 


KNSE 






Adrenergic receptor alpha 2B (a2BadrR, alpha- 
2B adrenoceptor, subtype C2) 


M34041.1 
Gl: 1781 97 


QDFRRAFRRILCRPWTQTAW 


QTAW . 


X 


AA244 


Adrenergic receptor alpha-IA (alAAdrR. 
Alpha 1A-adrenoceptor, Alpha-1C adrenergic 
receptor). 


U02569.1 
Gl:409028 


HQVPTIKVHTISLSENGEEV 


GEEV 


X 




Adrenergic receptor Atpha-1 A isofbnTi4 
(a1AAdri4) 


AF01 3261.1 
Gl:2978555 


REHIKHVNFMMPPWRKGLEC 


GLEC 


X 




Adrenergic receptor alpha-1 B (a1 BAdrR, 

la 1 D~elUI C7I lWUB|k/liJI f. 


U03865.1 


DVANGQPGFKSNMPLAPGQF 


PGQF 


X 




Adrenergic receptor atpha-1 C Isoform 2 


D32202.1 


FLVETGFHHVGQDDLDLLTS 


LLTS 






Adrenergic receptor atpha-1 C IsofbrmS 
(alCAdriS) 


D32201.1 
Gl:927210 


ITVSKDQSSCTTARGHTPMT 


TPMT 






Adrenergic receptor alpha-1 D (alDAdrR) 


U03864.1 
ni'4Q4QAn 


GATCQAYELADYSNLRETDI 


ETDI 


X 




Adrenergic receptor alpha-2A (a2AAdrRec, 
alpha-2A adrenoceptor, subtype CIO) 


M23533.1 
Gl:178195 


HDFRRAFKKILCRGDRKRIV 


KRIV 


X 


AA243 


Adrenergic receptor alpha-2C (a2CARC4, 
alpha-2C adrenoceptor, SubtypeC4) 


JOooOo.l 

Gl:178193 


DrRRorKnILrRKKRRCjrKCj 


CjrR(J 


X 


AAZ40 


Adrenergic receptor Beta- la (blAdrRec) 


J03019.1 
Gl:178199 


DSDSSLDEPCRPGFASESKV 


ESKV 


X 




Adrenergic receptor beta-2a (2AdrRec) 


Y00106.1 
Gl:29370 


VPSDNIDSQGRNCSTNDSLL 


DSLL 


X 




Adrenergic receptor Beta-3 (bSAdrRec) 


X72861.1 
Gl:298094 


SSPAQPRLCQRLDGASWGVS 


WGVS 






Adrenocorticotropic hormone receptor (ACTH 
receptor, ACTH-R, Melanocortln-2 receptor, 
MC2-R, Adrenocorticotropin receptor) 


X65633.1 
Gl:28343 


FRSPELRDAFKKMIFCSRYW 


SRYW 






Adrenomeduilin receptor (AM-R, AdrmedR) 


Y1 3583.1 
Gl:2652933 


AHHLLPNTSPISPTQPLTPS 


LTPS 






Angiotensin II receptor type 1 (ATI, Anglltl, 
AT1AR) 


M91464.1 
Gl:179121 


RPSDNVSSSTKKPAPCFEVE 


FEVE 






Angiotensin II receptor type-IB (Angtl IB, AT1B, 
ATI BR) 


D13814.1 
Gl:471120 


RPSDNVSSSTKKPAPCFEVE 


FEVE 






Angiotensin II receptor type-2 (Angll2R, AT2) 


U20860.1 
Gl:747969 


RESMSCRKSSSLREMETFVS 


TFVS 






Apelin receptor (G protein-coupled receptor 
APJ, Angiotensin receptor-like 1, HG11) 


U03642.1 
Gl:425351 


GGEQMHEKSIPYSQETL\A/D 


LWD 






Blue-sensitive opsin (Blue cone photoreceptor 
pigment, Btuopsin) 


M13299.1 
Gl:1469901 


TCSSQKTEVSTVSSTQVGPN 


VGPN 






Bombesin receptor subtype-3 (BRS-3) 


L08893.1 
Gl:291876 


SEISVTSFTGCSVKQAEDRF 


EDRF 


X 




Bradykinin receptor B1 (B1 brady R, BK-1 
receptor, B1R) 


U22346.1 
Gl:727358 


SLAPISSSHRKEIFQLFWRN 


FWRN 






Bradykinin receptor B2 (B2BK2R, BK-2 
receptor, B2R) 


X86165.1 
Gl:1220160 


TSISVERQIHKLQDWAGSRQ 


GSRQ 


X 




Brain-specific angiogenesis inhibitor 1 precursor 
(BAM pre) 


AB005297.1 
Gl:2653431 


RSGATIPLVGQDIIDLQTEV 


QTEV 


X 


AA181 


Brain-specific angiogenesis inhibitor 2 precursor 
(BAI2pre. BAI2) 


AB005298.1 
Gl:3021698 


HRAAAWEPTEPPDGDFQTEV 


QTEV 


X 




Brain-specific angiogenesis inhibitor 3 precursor 
(BAI3pre. BAI3, KIAA0550) 


AB005299.1 
Gl:3021700 


WEKCLNLPLDVQEGDFQTEV 


QTEV 


X 




C3a anaphylatoxin chemotactic receptor 
(C3achemR, C3AR) 


U28488.1 
Gl:1 199577 


TRSTHCPSNNVISERNSTTV 


STTV 


X 




C5a anaphylatoxin chemotactic receptor (C5a- 
R, CD88. CD88 antigen) 


X58674.1 
Gl:29568 


ESKSFTRSTVDTMAQKTQAV 


TQAV 


X 


AA269 
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Calcitonin gene-related peptide type 1 receptor 
precursor (CGRPIRpre, CGRPtype 1 receptor. 
oMLwr\L, vorvfr^, uor\iA/. 


L76380.1 
61:1321593 


NGKSiHDIENVLLKPENLYN 


NLYN 






Calcitonin receptor precursor (CalcRpre, CT-R. 
CALCR, CALR). 


L00587.1 
Gl:1 79879 


QGEESAEIIPl>inEQESSA 


ESSA 


X 




Calcium-mobilizing lysophosphatidic acid 
receptor LP-A3/EDG-7 (EDG7. EDG7#2) 


AF1 86380.1 
Gl:6003655 


GSQYIEDSISQGAVCNKSTS 


KSTS 






Cannabinoid receptor 1 (CB1. CB-R. CANN6) 


X54937.1 
Gl:29914 


TVKIAKVTMSVSTDTSAEAL 


AEAL 


X 




Cannabinoid receptor 2 (CB2, CB-2, CX5) 


X74328.1 

GI.4U7oOD 


EADGKITPWPDSRDLDLSDC 


LSDC 


X 




C-C Chemokine binding protein 2 (Chemokine- 
binding. protein D6, C-C chemokine receptor D6, 

Vyiicniuivnie icccpiUi wV^r\-9, oVi^-onernuKiric 

receptor CCR10) 


U94888.1 
Gl:2213808 


LGERQSENYPNKEDVGNKSA 


NKSA 






w-Lf onemoKine reoepior o ^uror\o/ 


rv\DOt f ST**. 1 

Gl:21 04521 


PRr50QA0<^TQRPPDnHQTP\/ 
"^\Vawor^Vi^o i orxcc.r'L/no i cv 


O 1 CV 


X 




C-C Chemokine receptor 9A (CCR9A) 


AJ132337.1 
Gl:4886431 


EGSLKLSSMLLETTSGALSL 


ALSL 


X 




C-C Chemokine receptor type 1 (MIPIaR, C-C 
CKR-1. CCR-1, MIP.1alpha-R, RANTES-R. 
HM145, LD78 receptor) 


L09230.1 
Gl:179984 


LERVSSTSPSTGEHELSAGF 


SAGF 






C-C chemokine receptor type 10 (CC-CKR-10, 
CCR-10, G- protein coupled receptor 2) 


AF215981.1 
Gl:7546844 


RPRLSSCSAPTETHSLSWDN 


SWDN 






C-C chemokine receptor type 1 1 (CC-CKR-1 1 , 
CCR-1 1 , Chemokine receptor-like 1 , CCRL1 , 
CCX CKR) 


AF193507.1 
GI:73o3341 


VEEFPFDSEGPTEPTSTFSI 


TFSI 


X 




C-C chemokine receptor type 2 (C-C CKR-2, 
CCR-2, Monocyte chemoattractant protein 1 
receptor, MCP-1-R) 


U03882.1 
Gl. 472555 


GKGKSIGRAPEASLQDKEGA 


KEGA 


X 




C-C chemokine receptor type 3 (C-C CKR-3. 
L»v»r\-o, L/i\r\o, tosinopnii eoiaxin recepior^ 


U28694.1 


LERTSSVSPSTAEPELSIVF 


SIVF 


X 


AA43 


C-C chemokine receptor type 4 (C-C CKR-4, 
CCR-4, K5-5) 


AB023888.1 
Gl:6467134 


DTPSSSYTQSTMDHDLHDAL 


HDAL 


X 




CC Chemokine receptor type 5 (C-C CKR5, 
CCR5, HIV-1 fusion co-receptor, CHEMR13, 
CD1 95 antigen) 


X91492.1 
Gl:1262810 


ERASSVYTRSTGEQEISVGL 


SVGL 


X 


AA216 


C-C chemokine receptor type 6 (CC-CKR-6, 
CCR-6, LARC receptor, GPRCY4, Chemokine 
receptor-like 3, CKR-L3, DRY6) 


U45984.1 
Gl:2246432 


NISRQTSETADNDNASSFTM 


SFTM 






C-C chemokine receptor type 7 (CC-CKR-7, 
CCR-7, MlP-3 beta receptor, EBV-induced G 
protein-coupled receptor 1, EBI1, BLR2) 


L08176.1 
Gl: 183484 


RHIRRSSMSVEAETTTTFSP 


TFSP 






C-C chemokine receptor type 8 (CC-CKR-8, 
CCR-8, GPR-CY6, Chemokine receptor-like 1 . 
CKR-L1 , TER1 , CMKBRL2. CC- chemokine 
receptor CHEMR1) 


U45983.1 
Gl;2231165 


EKSSSCQQHSSRSSSVDYIL 


DYIL 


X 




Cell surface glycoprotein EMR1 precursor 
(EMRIpre, EMR1 homrione receptor). 


X81479.1 
Gl:784993 


SQSQTSRILLSSMPSASKTG 


SKTG 






Chemokine receptor-like 1 (G-protein coupled 
receptor DEZ, G protein- coupled receptor 
ChemR23) 


U79526.1 
Gl: 1732342 


TKMSSMNERTSMNERETGML 


TGML 






Chemokine receptor-iike 2 (IL8-related receptor 
DRY12, Flow-induced endothelial G protein- 
coupled receptor, FEG-1 , G protein-coupled 
recepior okKoU, \jnOK-DK^ 


Y08162.1 
Gl: 1707499 


LKAVIPDSTEQSDVRFSSAV 


SSAV 


X 




Cholecystokinin type A receptor (CCK-A 
receptor, CCKAR) 


L13605.1 
Gl:306490 


TGASLSRFSYSHMSASVPPQ 


VPPQ 






Corticotropin releasing factor receptor 1 
precursor (CRFRIpre, CRFR, CRF1, CRHR1, 


L23333.1 
Gl:408691 


SIPTSPTRVSFHSIKQSTAV 


STAV 


X 




Corticotropin releasing factor receptor 2 
precursor (CRFR 2, CRF2, Corticotropin- 
releasing hormone receptor 2, CRHR 2) 


U34587.1 
Gl: 1144507 


SIPTSPTRISFHSIKQTAAV 


TAAV 


X 




CX3C chemokine receptor 1 (CX3CR1, 
Fractalkine receptor, GPR13, V28, Beta 
chemokine receptor-like 1. CMKBLR1) 


U20350.1 
G!:665680 


SVLSSNF7YHTSDGDALLLL 


LLLL 


X 




C-X-C Chemokine receptor type 3 (CXCR-3, 
CKR-L2, CD183 antigen). 


X95876.1 
Gl:1552845 


SSSRRE>SSWSETSEASYSGL 


YSGL 


X 


AA124 
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C-X-C chemokine receptor type 4 (CXC-R4, 
Stromal cell- derived factor 1 receptor, SDF-1 
receptor, Fusin, Leukocyte-derived seven 
transmembrane domain receptor, LESTR, 
LCR1. FB22. NPYRL, HM89, CD184 antigen) 


L01639.1 
Gl:189313 


KRGGHSSVSTESESSSFHSS 


FHSS 






C-X-C chemokine receptor type 5 (CXCR-5, 
Burkitt'S lymphoma receptor 1 , BLR1 , 
Monocyte-derived receptor 15, MDR15) 


X68149.1 
Gl:29459 


PSWRRSSLSESENATSLTTF 


LTTF 


X 


AA45 


w-A-Lr cnemokine receptor type b (Uauk-o, o 
protein-coupled receptor bonzo, G protein- 


ArUU/040.1 
Gl:2253421 


r»K| CI^TETC A CLIKI\/C AXCM CO 1 

UiMoIn I ro/\orllNVCA\ I oMrUL 


n/lrUL 


X 




Dopamine receptor 1A (OopRecIA, DRD1) 


X55758.1 
Gi:288931 


DTDVSLEKIQPITQNGQHPT 


QHPT 






Dopamine receptor D2 (DRD2) 


IVIoUO^O.1 

Gl:181431 


MY 1 1 rNltrKIVArUMLnL* 


II ur^ 


X 




Dopamine receptor D3 (DopRecD3) 


Gl:927341 


VI Yl IrNltrKlNArLlvlLoO 


II 


X 




LJUpdiTlllit; reOcpiUI LJH yiJ\£\^) UDpailllfie 

receptor) 


Gl:291946 




RACC 


X 




D-1 B dopamine receptor, 05 dopamine 
receptor, Dibeta dopamine receptor) 


Gl:32048 


nnPHFI*?! DKITPFTPNGFH 


NGFH 






EGF-like module EMR2 (EiVIR2egf, EMR2) 


AF1 14491.1 
GI:6650688 


EMHTLSSSAKADTSKPSTVN 


STVN 






EGF-like module-containing mucin-like receptor 
EMR3 (EMR3) 


AF239764.1 
Gl:13183148 


GPDSKPSEGDVFPGQVKRKY 


KRKY 






Endothelial differentiation protein 1 (EDG-1 , G 
protein-coupled sphingoiipid receptor) 


M31210.1 
Gl:181948 


KDEGDNPETIMSSGNVNSSS 


NSSS 






Endothelial differentiation protein 4 
(Lysophosphatidic acid G protein-coupled 
receptor 4, Endothelial differentiation 
lysophosphatidic acid G-protein-coupled 
receptor 4, EDG4) 


AF233092.1 
Gl:7243675 


GASTRIMLPENGHPLMDSTL 


DSTL 


X 




Endothelial differentiation protein 5 


AF034780.1 


LERGMHMPTSPTFLEGNTW 


NTW 


X 




Endothelin B receptor precursor (ET-B, 
Endothelin receptor Non-selective type) 


M74921.1 
Gl:182275 


FKANDHGYDNFRSSNKYSSS 


YSSS 






cnuuinciin d reuepior-iirvc piuicin-^c piciAJiour 
(EndoBRp2. ETBR-LP-2. ETBRLP2) 


T lO^OU. 1 

Gl:3059117 


<^IYFHKPRFQPPI 1 PI rSTPP 




X 




Endothelin-1 receptor precursor (ET-A, 

EndlRpre). 


D90348.1 
Gl:219649 


KNHDQNNHNTDRSSHKDSMN 


DSMN 






ETL protein (EGF-TM7-latrophilin-retated 
protein, ETL) 


AF1 92403.1 
Gl:1 1225482 


IQEEYYRLFKNVPCCFGCLR 


GCLR 






Extracellular calcium-sensing receptor 
precursor (CalRec, CASR, Parathyroid Cell 
calcium-sensing receptor, GPRC2A, PCAR1). 


X81086.1 

Gl:599819 


SQSFVISGGGSWTENWNS 


NA/NS 






fMet-Leu-Phe receptor (fMLP receptor, N- 
formyl peptide receptor, FPR, N-fbnmylpeptide 
chemoattractant receptor) 


M37128.1 
G!:189183 


TSDTATNSTLPSAEVELQAK 


LQAK 






FMLP-related receptor 1 (FMLP-R-I, LIpoxin A4 
receotor LXA4 receotor RFP HM63^ 


M76672.1 
Gl- 182666 


TNDTAANSASPPAETELQAM 


LOAM 






FMLP-related receptor II (FMLP-R-I 1, 
FMPLreIR) 


M76673.1 
Gl:182668 


TSNTHTTSASPPEETELQAM 


LOAM 






Follicle stimulating hormone receptor precursor 
fFSH-R FollitroDin receotor^ 


M65085.1 
Gl: 182770 


PRVTNGSTYILVPLSHLAQN 


LAQN 






Frizzled 1 precursor (Fzdl pre, Frizzled-1, Fz-1, 
hFz1,FzE1,FZD1). 


AF072872.1 
Gl:5305406 


NSWRKFYTRLTNSKQGETTV 


ETTV 


X 




Frizzled 10 precursor (FzdIOpre, Frizzled-1 0, 
Fz-10, hFzIO, FzE7, FZD10) 


AB027464.1 
Gl:5834487 


HPQKTHHGKYEIPAQSPTCV 


PTCV 


X 




Frizzled 2 precursor (Fzd2pre, Friz2led-2, Fz-2, 
hFz2, FzE2, FZD2). 


L37882.1 
Gl:736678 


HSWRKFYTRLTNSRHGETTV 


ETTV 


X 




Frizzled 3 precursor (FzdSpre, Frizzled-3, Fz-3, 
hFz3, FZD3). 


AB039723.1 
Gl:7670051 


THITHGTSMNRVIEEDGTSA 


GTSA 


X 




Frizzled 4 precursor (Fzd4pre, Friz2led-4, Fz-4, 
hFz4. FzE4. FZD4). 


AB032417.1 
Gl:6277285 


KREKRGNGWVKPGKGSETW 


ETW 


X 




Frizzled 5 precursor (Fzd5pre, Frizz!ed-5, Fz-5, 
hFz5, FzE5, FZD5, HFZ5). 


U43318.1 
Gl:11 51251 


RTGPPGPAATYHKQVSLSHV 


LSHV 


X 
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Frizzled 6 precursor <Fzd6pre, Frizzled-6. Fz-6, 
hFz6, FZD6)). 


AB012911.1 
Gl:3062802 


LVHPVSGVRKEQGGGCHSDT 


HSDT 






Frizzled 8 precursor (FzdSpre, Frizzted-8, Fz-8, 
hFz8 FZD8) 


AB043703.1 
61-13623798 

Wl* 1 wwfc w 9 wW 


WRSGTASSVSYPKQMPLSQV 


LSQV 


X 




Frizzled 9 precursor (Fzd9pre, Frlzz!ed-9, Fz-9, 
hFz9, FzE6, FZD9). 


U82169,1 
Gl:1906597 


PTWLHMTKTDPSLENPTHL 


PTHL 


X 




Galanin receptor type 1 (GAL1-R, GALR1) 


L34339.1 
Gl:559047 


DTKENKSRIDTPPSTNCTHV 


CTHV 


A 




Galanin receptor type 2 {GAL2-R, GALR2) 


AF040630.1 
Gl:2921759 


PGPSWQGPKAGDSILWDVA 


VDVA 


X 




Galanin receptor type 3 {GAL3-R, GALR3). 


AF073799.1 
Gl:3608409 


QGPEPREGPVHGGEAARGPE 


RGPE 






Gamma-aminobutyric acid type B receptor, 
subunit 1 precursor (GABAB1 pre, GABA-B-R1, 
Gb1. GABBR1, GABA-B receptor 1, GBR1) 


AJ225028.1 
Gl:3892593 


PPEPPDRLSCDGSRVHLLYK 


LLYK 






v3ciii 11 1 Id ai 1 III iwiJUiy 1 <o d^^ivj \y\JK: o ixs\^\jwji^ 

subunit 2 precursor (GABABR2p, GABA-B 
receptor 2, GBR2, GABBR2, GABA-B-R2, Gb2, 
G protein-coupled receptor 51 , GPR51 , HG20). 


AJ012188 1 
Gl:3776097 


TASPRHRHVPPSFRVMVSGL 

1 f^K^M 1 Al II XI i V 1^ I ^^1 t X V IVI V v^v^^ 


VSGL 


V 

A 




Gastric inhibitory polypeptide receptor precursor 
(GIPRpre, GIPR, Glucose-dependent 


U392o1.1 
Gl:1066050 


ooo 1 LPCjPoNtAoKtLto T U 




X 




Gastrin/cholecystokinln type 8 receptor (CCK-B 


L08112.1 
Gr306488 


PSIASLSRLSYTTISTLGPG 


LGPG 






Gastrin-releasing peptide receptor (GRP-R, 
GRP-preferring bombesin receptor) 


M73481.1 
Gl:183649 


NPSVATFSLINGNICHERYV 


ERYV 


X 




GHRH receptor splice variant 1 (GHRHRspl) 


AF282259.1 
Gl:10242291 


TRAKWTTPSRSAAKVLTSMC 


TSMC 


X 




glucagon receptor precursor (GlucagRp, GL-R, 
GCGR) 


U03469.1 
Gl:439689 


DSSAETPLAGGLPRLAESPF 


ESPF 


X 




Glucagon-like peptide 1 receptor precursor 
(GLPIRpre, GLP-1 receptor, GLP-1-R) 


U01 104.1 
Gl:406081 


SSGATAGSSMYTATCQASCS 


ASCS 






Glucagon-like peptide 2 receptor precursor 
(GLP2Rpre. GLP-2 receptor, GLP-2-R, 
GLP2R). 


AF1 05367.1 
Gl:4324490 


SEGDVTMANTMEEILEESEI 


ESEI 


X 




Gonadotropin-releasing hormone receptor 


L03380.1 
Gl:183421 


FLFAFLNPCFDPLIYGYFSL 


YFSL 


X 




G-protein coupled receptor 91 (GPCR91) 


AF348078.1 
Gl:13517982 


KSLTSFSRWAHELLLSFREK 


FREK 






G-protein coupled receptor EDG-7 (EDG7) 


AF236117.1 
Gl:9651838 


GSQYKEDSSSQG7VCNKNSS 


KNSS 






G-protein-coupled receptor 74 (GPCR74) 


AF236083.1 
Gl:14279164 


QNPHGETLLYRKSAENPNRN 


PNRN 






Green-sensitive opsin (Green cone 
photoreceptor pigment) 


Ml 3306.1 
Gl:180688 


SELSSASKTEVSSVSSVSPA 


VSPA 


y 




Growth hormone secretagogue receptor type 1 
(GHS-R, GH-releasing peptide receptor, 
GHRP, Ghrelin receptor) 


U60 179.1 
Gl:1504140 


KLSTLKDESSRAWTESSINT 


SINT 






Growth honmone-releasing hormone receptor 
precursor (GHRHRpre, GRFreceptor, GRFR). 


L01406.1 
Gl:183172 


TRAKWTTPSRSAAKVLTSMC 


TSMC 


X 




Histamine HI receptor (HistHIR) 


Z34897.1 
Gl:510295 


YPLCNENFKKTFKRILHIRS 


HIRS 






Histamine H2 receptor (HistH2R, H2R, Gastric 
receptor 1) 


M64799.1 
Gl:1 84087 


LKLQVWSGTEVTAPQGATDR 


ATDR 






Histamine H3 receptor (HH3R, G protein- 
coupled receptor 97) 


AF1 40538.1 
Gl:5031290 


LLCPQKLKIQPnooLcnCVm 


nCWiv 






Histamine H4 receptor (HH4R, GPRv53, G 
protein-coupled receptor 105, GPGR105. 


AB044934.1 
Gl:1 0241 846 


KIFCIKKQPLPSQHSRSVSS 


SVSS 






HOR5'beta13 (HOR5'b13) 


AAG41 677.1 
Gl:11908212 


HKFMSLCTSNALPNYLFHQD 


FHQD 






HOR5*beta5 (HOR5'b5) 


AAG41683.1 
Gl:11908218 


KTKQIQNAILHLFTTHRIGT 


RIGT 






HOR5'beta6 {HOR5'b6) 


AAG41682.1 
Gl:11908217 


KTKQIQSGILRLFSLPHSRA 


HSRA 


X 




HOR5'beta7 (HOR5'b7) 


AAG41681.1 
Gl:1 1908216 


KTKEIHRAIIKLLGLKKASK 


KASK 
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HORS'betaS (HOR5'b8) 


AAG41680.1 
Gl:1 1908215 


KTKEIHGAIVRMLLEKRRRV 


RRRV 


X 




human TA2R, beta isoform (TA2Rbiso. 
TBXA2R) 


AAC24302.1 
Gl:3253117 


AGVQLLPFEPPTGKALSRKD 


SRKD 






InterleukirvS receptor A (IL8RA, high affinity IL-8 

receptor A, iL-8 receptor type 1 , CXCR-1 , 
CDw128a) 


AAB59436.1 

Gl:559050 


LARHRVTSYTSSSVNVSSNL 


SSNL 


X 


AA293 


lnterleui^in-8 receptor B (IL8RB, high affinity IL-8 
receptor B, CXCR-2, GRO/MGSA receptor. 
CDw128b) 


M73969.1 
Gl:186516 


PKDSRPSFVGSSSGHTSTTL 


STTL 


X 


AA29.2 


KIAA0821 protein. 


AB020628.1 


PGLEGPGPDGDGQMQLVTSL 


VTSL 


X 




Latrophilin-2 (LPHH1, LEC1. LATROPH2) 


AJ131581.1 


EGCIPEGDVREGQMQLVTSL 


VTSL 


X 




Lectomedin-1 alpha (LECIalph. LEC1) 


AF1 04266.1. 

Ol.000U<t09 


GLRAHLQDLYHLELLLGQIA 


GQIA 


X 




Lectomedin-1 beta (LECIbeta. LEC1) 


API 04938.1 
Gl;5oo0491 


VKASTTRTSARYSSGTQDIH 


QDIH 






Lectomedin-2 (LEC2) 


AF307079.1 

Gi:n037013 


PGLEGPGPDGDGQMQLVTSL 


VTSL 


X 




Lectonriedin-3 (Lecmed3, LEC3) 


AF307080.1 
Gl:1 1037015 


IGASEQCQGYKCHGYSTTEW 


TTEW 


X 




Leucocyte antigen CD97 precursor (CD97pre, 
CD97) 


X84700.1 
Gl:840770 


TTSGTGHNQTRALRASESGI 


ESGI 


X 




Leukotriene 84 receptor 2 (BLTR2, Seven 
transmembrane receptor BLTR2) 


AJ278605.1 
Gl:8919627 


GRGNGDPGGGMEKDGPEWDL 


EWDL 






Luteinising hormone-choriogonadotropin 
receptor (Luteinizing hormone receptor) 


X84753.1 
Gl: 1225983 


LSTLHCQGTALLDKTRYTEC 


YTEC 


X 




Lysophosphatidic acid receptor (EDG-2). 


U80811.1 

^^t.A A'^A 

Gill 857424 


ASSLNHTILAGVHSNDHSW 


HSW 


X 




Lysosphingolipid receptor (EDG-3). 


X83864.1 
GI: 1770395 


DPSSCIMDKNAALQNGIFCN 


IFCN 






Melanocortin-3 receptor (i\/IC3-R) 


LOS 155.1 
Gl:188673 


LELRNTFKblLCGONCjMNLC? 


MNLG 






Melanocortin-4 receptor (MC4-R) 


L08603.1 
Gl:291977 


FKEIICCYPLGGLCDLSSRY 


SSRY 


X 




Melanocortin-5 receptor (MC5-R, MC-2) 


Z25470.1 
Gl:939924 


FKEIICCRGFRIACSFPRRD 


PRRD 






Melanocyte stimulating hormone receptor 
(MSH-R, Melanotropin receptor, Melanocortin-1 
recepior, mv^i-k^ 


X65634.1 
Gl:34790 


Y AF nSQELRRTLKt VLTCSW 


TCSW 






Melatonin receptor type 1 A (Mel-1 A-R) 


U14108.1 
Gl:602129 


VKNAflCPSPLMTNNNWKVDSV 


VDSV 


X 




Melatonin receptor type 1 B (Met-1 B-R) 


U25341.1 
Gl:971193 


EGLQSPAPPIIGVQHQADAL 


ADAL 


X 




Melatonin-related receptor (H9. GPR50) 


U52219.1 
Gl:1326154 


NDYHDWWDVEDDPDEMAV 


EMAV 


X 




Metabotropicgiutamate receptor 1 precursor 
(GluRlpre, GRM1, GPRC1A, mGluRI) 


U31215.1 
GI.945096 


PNVSYASVILRDYKQSSSTL 


SSTL 


X 




Metabotropicgiutamate receptor 2 precursor 
(GluR2pre, MGR2, mGluR2, GRM2, GPRC1B) 


L35318.1 
Gl:999415 


QFVPTVCNGREWDSTTSSL 


TSSL 


X 




Metabotropicgiutamate receptor 3 precursor 


X77748.1 
Gl' 11 71 56*^ 

I.I If 1 


TYVPTVCNGREVLDSTTSSL 


TSSL 


X 




Metabotropicgiutamate receptor 4 precursor 


X80818.1 

\j 1 . 1 1 uv 1 0£ 


LEAPALATKQTYVTYTNHAI 


NhlAi 






Metabotropicgiutamate receptor 5 precursor 


D28538.1 
01*14080*51 


SSPKYDTLIIRDYTQSSSSL 


SSSL 


X 




Metabotropic glutamate receptor 6 precursor 

^olUKopre, oKmo, oKKOlr, molUKD^ 


U82083.1 


LKATSTVAAPPKGEDAEAHK 


EAHK 






Metabotropic glutamate receptor 7 precursor 
fGluR7nre GRM7 GPRC1G mGluR7\ 


X94552.1 
Gl- 1370110 


VDPNSPAAKKKYVSYNNLVI 


NLVI 


X 


AA114 


Metabotropicgiutamate receptor 8 precursor 
(GluR8pre. GRM8. GPRC1H. mGluRd) 


U92459.1 
Gl:1935042 


LETNTSSTKTTYISYSNHSI 


NHSt 






Motilin receptor (G protein-coupled receptor 
GPR38) 


AF034632.1 
Gl:2654158 


DTGGDTVGYTETSANVKTMG 


KTMG 






Muscarinic acetylcholine receptor M1 
(AchRmI) 


X52068.1 
Gl:34450 


RWRKIPKRPGSVHRTPSRQC 


SRQG 


X 




Muscarinic acetylcholine receptor M2 


M16404.1 
Gl:177989 


FKKTFKHLLMCHYKNIGATR 


GATR 
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Muscarinic acetylcholine receptor M3 
(AchRM3) 


X152e6.1 
Gl:32323 


QQYQQRQSVIFHKRAPEQAL 


EQAL 


X 


AA252 


Muscarinic acetylcholine receptor M4 
(AchRM4) 


M16405.1 
Gl:177991 


FKKTFRHLLLCQYRNIGTAR 


GTAR 






Muscarinic acetylcholine receptor M5 


M80333.1 
GI:177987 


RWKKKKVEEKLYWQGNSKLP 


SKLP 






Neuromedin K receptor (NKR, Neurokinin B 
receptor, NK-3 receptor, NK-3R) 


S86392.1 
Gl:246908 


SASATSSFISSPYTSVDEYS 


DEYS 






Neuromedin K receptor (NKR, Neurokinin B 
receptor, NK-4 receptor. NK-4R. K1R. 
Neurokinin 4 receptor, NK4) 


M84605.1 
Gl:189391 


STSTTASFVSSSHMSVEEGS 


EEGS 






Neuromedin U receptor 1 (NMUR1) 


AF272362.1 
01:10946200 


WVHPl^GNDGPEAQQETDPS 


TOPS 






Neuromedin U receptor 2 (NeUR2, Neuromedin 
U receptor-type 2, G protein-coupled receptor 
TGR-1) 


AF272363.1 
Gl:10946202 


ALSSEQMSRTNYQSFHFNKT 


FNKT 






Neuromedin^B receptor (NMB-R. Neuromedin- 
B-prefemng bombesin receptor) 


M73482.1 
Gl:189241 


NMVTNSVLLNGHSMKQEMAM 


EMAM 






Neuropeptide FF receptor 1 (NepepFFI , RF 
amide-related peptide receptor OT7T022) 


AB040104.1 
Gl:11 125701 


LPREGPGCSHLPLTIPAWDI 


AWDI 


X 




Neuropeptide FF receptor 2 (Neuropeptide G 
protein-coupled receptor, G-protein-coupled 
receptor HLWAR77) 


AF1 19815.1 
Gl:4530468 


KPQQELVMEELKETTNSSEI 


SSEI 


X 




Neuropeptide Y receptor type 1 (NepepYRI , 
NPY1-R) 


M88461.1 
Gl:189155 


KQASPVAFKKINNNDDNEKI 


NEKI 


X 




Neuropeptide Y receptor type 2 (NPY2-R, NPY- 
Y2 receptor, NepepYR2) 


U36269.1 
Gl:1063633 


NLEVRKNSGPNDSFTEATNV 


ATNV 


X 




Neuropeptide Y receptor type 4 {NPY4-R, 
Panaeatic polypeptide receptor 1, PP1) 


U35232.1 
Gl:1063629 


TS/HTEVSKGSLRLSGRSNPI 


SNPI 






Neuropeptide Y receptor type 5 (NPY5-R, NPY- 
Y5 receptor, Y5 receptor, NPYY5) 


U56079.1 
Gl:1438903 


GFLNNGIKADLVSLIHCLHM 


CLHM 






Neurotensin receptor type 1 (NT-R-1 , High- 
affinity levocabastine- insensitive neurotensin 
receptor, NTRH) 


X70070.1 
Gl:35020 


ADSVSSNHTLSSNATRETLY 


ETLY 


X 




Neurotensin receptor type 2 (NT-R-2, 
Levocabastine-sensitive neurotensin receptor, 
NTR2 receptor) 


Y10148.1 
Gl:3901027 


QSPTLMDTASGFGDPPETRT 


ETRT 






Ocular albinism type 1 protein (OculAlbl, OA1) 


Z48804.1 
Gl:886873 


ASESCNKNEGDPALPTHGDL 


HGDL 


X 




Odorant receptor HOR3'beta1 (HOR3'b1) 


AAG42364.1 
GI:11991oo3 


SVKTQQIHTRMLRLFSLKRY 


LKRY 


X 




Odorant receptor HOR3'beta3 (HOR3'b3) 


AAG42366.1 


KIKEIRNSWLTLSRKRGEF 


RGEF 


X 




Odorant receptor HOR3'beta5 (HOR3'b5) 


AAG42368.1 

Oi.l 1^79 lOO/ 


VKTKQIRDHIVKVFFFKKVT 


KKVT 






Olfactory receptor 10A4 (OlfR10A4, HP2, 
olfactory-like receptor protein JCG5) 


AF209506.1 
Gl: 1701 6309 


KEVKAALKRLIHRTLGSQKL 


SQKL 






Olfactory receptor 10A5 (OlfR10A5, HP3, 
ruiaiive lasie recepior jooo^- 


AAG45206.1 


VKNALSRTFHKVLALRNCIP 


NCIP 






Olfectory receptor 10H1 (OlfRIOHl) 


AAC08454.1 
Gl:2996652 


KVAMKKTFFSKLYPEKNVMM 


NVMM 






Olfactory receptor 10H2 (OlfR10H2) 


AAC14388.1 


KELKVAMKRTFLSTLYSSGT 


SSGT 






Olfactory receptor 10J1 (OlfRI 0J1 , Olfactory 

icuepiui-iii\c pruiciii noivinur %/) 


X64995.1 


TLRNKEVKDALCRAVGGKFS 


GKFS 






Olfactory receptor 1 1A1 (OlfRI 1A1. Hs6M1-18) 


AJ302614.1 
Gl:12054452 ^ 


KEVHQALRKILCIKQTETLD 


ETLD 






Olfactory receptor 12D3 (OlfRI 2D3, Hs6M1-27) 


CAB65796.1 
Gl:6691936 


MMALKKIFGRKLFKDWQQHH 


QQHH 






Olfactory receptor 1A1 (OIfR1A1, Olfactory 
receptor 17-7, OR17-7). 


AF087918.1 
Gl:7144622 


LRNRDMKAALRKLFNKRISS 


RISS 






Olfactory receptor 1A2 (OlfRI A2, Olfactory 
receptor 17-6, OR17-6) 


AF1 55225.1 
Gl:5081803 


LRNWDMKAALQKLFSKRISS 


RISS 






Olfactory receptor 1 D2 (Olfactory receptor-like 

protein HGMP07E, Olfactory receptor 17-4, 
OR17>4) 


X65857.1 

01:425220 


NKDMHGALGRLLDKHFKRLT 


KRLT 






Olfactory receptor 1 D4 (OlfRI D4, Olfactory 
receptor 17-30, OR17'30). 


AF087922.1 
Gl:71 44627 


NKDMHGAPGRVLWRPFQRPK 


QRPK 
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Olfactory receptor 1E1 (OlfRIEI, Olfactory 
reoepior-iiK6 proiein noivir'U/i; 


X64994.1 


RDMKGALSRVIHQKKTFFSL 


FFSL 


X 




Olfactory receptor 1E2 (OlfR1E2, Olfactory 
receptor 17-93/17-135, OR17-93) 


AF087925.1 
Gl:7144633 


RDMKGALERVICKRKNPFLL 


PFLL 


X 




Olfactory receptor 1F1 (OlfRIFI, Olfactory 
receptor 16-35, OR16-35). 


Y14442.1 
Gl:2370144 


RNRYLXGALKKWGRWFSV 


VFSV 


X 




Olfactory receptor 1G1 (OlfRIGI, Olfactory 
receptor 17-209, OR1 7-209) 


AF087928.1 
Gl:7144638 


NQEIKSSLRKLIWVRKIHSP 


IHSP 






Olfactory receptor 1 11 (OlfRI 11 , Olfactory 
receptor 19-20, OR19-20) 


AAC1 8915.1 
Gl:31 84262 


MHPIPYPGGVQSLAGNRDME 


ROME 






Olfactory receptor 2A4 (OlfR2A4) 


AAD05193.1 
Gl:41 59884 


LRNSEVKNTLKRVLGVERAL 


ERAL 


X 




Olfactory receptor 2AG1 (OlfR2AG1, HT3) 


Q9H205 
Gl: 14423804 


VMRALRRVLGKYMLPAHSTL 


HSTL 


X 




Olfactory receptor 2B2 (OlfR2B2. Olfactory 
receptor 6-1, OR6-1, Hs6M1-10) 


AJ302584.1 
Gl: 12054392 


CPIFVITIENYCNLPQRKFP 


RKFP 






Olfactory receptor 2B3 (OlfR2B3, Olfactory 
receptor 6-4, OR6-4, Hs6M1-1) 


CAA1 8782.1 
Gl:3757726 


NKDMKEAFKRLMPRIFFCKK 


FCKK 






Olfactory receptor 286 (OlfR2B6, Hs6M1-32, 
vjiiaciory receptor o-oi , ljkd-oi 


CAC14158.1 


NKEVKEGFKRLVARVFLIKK 


LIKK 






Olfactory receptor 2C1 (OlfR2C1, OLFmf3). 


AF098664.1 


RNMEVKGALRRLLGKGREVG 


REVG 






Olfactory receptor 2D2 (OlfR2D2, Olfactory 
receptor 1 1-610, OR1 1-610, HB2) 


AAG45204.1 
Gl: 12007434 


SLRNKDVKAALRKVATRNFP 


RNFP 






Olfactory receptor 2F1 (OlfR2F1, Olfactory 


U56421.1 


KGAWQKLLWKFSGLTSKLAT 


KLAT 






Olfactory receptor 2F2 (OlfR2F2, Olfactory 


AAC643781 


KGAWHKLLEKFSGLTSKLGT 


KLGT 






Olfactory receptor 2H1 (OR2H1 , OlfR2H1 , 
Hs6M1-16, Olfactory receptor 6-2, OR6-2) 


AJ302604.1 
Gl: 12054432 


RALRRLLGKERDSRESWRAA 


WRAA 


X 




Olfactory receptor 2H3 (OlfR2H3, Olfactory 
receptor-like protein FAT1 1 ) 


L35475.1 
Gl:1041044 


RAFRRLLGKERDSRESWRAA 


WRAA 


X 




Olfactory receptor 2J2 (Olfactory receptor 6-8, 
OR6-8, HS6M1-6) 


AJ302571.1 
Gl:12054366 


LRNKHVKGAAKRLLGWEWGK 


EWGK 






Olfactory receptor 2J3 (OlfR2J3, Olfactory 
receptor 6-6, OR6-6. Hs6M1-3). 


CAA18783.1 
Gl:3757727 


lYTLRNKNA/RGAVKRLMGWE 


MGWE 






Olfactory receptor 2T1 (OIfR2T1, OR2T1, 
Olfactory receptor 1-25, OR1-25) 


XM_060316.1 
Gl:17437062 


ALKRALGRFKGPQRVSGGVF 


GGVF 


X 




Olfactory receptor 2W1 (OlfR2W1, Hs6M1-15). 


CAB42853.1 
01:4826521 


LKKLMRFHHKSTKIKRNCKS 


NCKS 






Olfactory receptor 3A1 (OlfR3A1 , Olfactory 
receptor 17-40, OR17-40) 


X80391.1 
Gl:516319 


RNPDVQSAIWRMLTGRRSLA 


RSLA 


X 




Olfactory receptor 3A2 (OlfR3A2. Olfactory 
receptor i /^-^^o, vjki /-^b, ukoia^, ulpKAvW) 


AF087930.1 
ol.7144o41 


RNPDVQGALWQIFLGRRSLT 


RSLT 






Olfactory receptor 3A3 (OlfR3A3. Olfactory 
receptor 17-201 , OR1 7-201) 


AF087926.1 
Gl:7144635 


RNTDVQGALCQLLVGERSLT 


RSLT 






Olfactory receptor 4F3 (OlfR4F3) 


AAD05195.1 
01:4159886 


EMKAAIKRVCKQLVIYKRIS 


KRIS 






Olfactory receptor 5182 (HOR5'b3, 
HOR5'beta3, OR51B2) 


AAD29425.2 
Gl:11908208 


KTKQIQYGIIRLLSKHRFSR 


RFSR 






Olfactory receptor 51 84 (HOR5'b1 , 
HORS'betal. OXB4, OR51B4) 


AAD29426.2 
01:11908209 


IKTKQIQRSIIRLFSGQSRA 


QSRA 


X 




Olfactory receptor 51 E2 (OlfR51E2, Prostate 
specific G-protein coupled receptor, OXE2, 
HPRAJ, OR51E2, PSGR) 


AF31 1306.1 
Gl:1 1875777 


RVLAMFKISCDKDLQAVGGK 


VGGK 






Olfactory receptor 51 11 (HOR5'b1 1 , 
HOR5'beta11.0R51l1). 


AAG41679.1 
01:11908214 


SVKTKEIRKGILKFFHKSQA 


KSQA 


X 




Olfactory receptor 5112 (HOR5'b12. 
HOR5'beta12, OR51I2) 


AAG41678.1 
Gl:11908213 


SAKTKEIRRAIFRMFHHIKI 


HIKI 


X 




Olfactory receptor 52A1(HOR3'b4, HPFH10R. 
nOR3 Deta4, OR52A1 ) 


AAG42367.1 
GI:1199186o 


LVYGAKTTQIRIHWKMFCS 


MFCS 






Olfactory receptor 52D1 (HOR5'b14, 
HOR5'beta14, OR52D1) 


AAG41676.1 
Gl:1 1908211 


RTKEIRSRLLKLLHLGKTSI 


KTSI 


X 




Olfactory receptor 5F1 (OlfRSFI, Olfactory 
receptor 11-10. OR11-10) 


095221 
Gl:14423782 


KEVKKALANVISRKRTSSFL 


SSFL 


X 




Olfactory receptor 511 (OlfR5l1. Olfactory 
receptor-like protein 0LF1) 


U56420.1 
Gl:1336040 


RNKDVKDAAEKVLRSKVDSS 


VDSS 






Olfactory receptor 5U1 (OlfRSUI, Hs6M1-28). 


XM 167134.2 
Gi:22059864 


MLSKEELPQRKMCLKAMFKL 


MFKL 


X 
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Olfactory receptor 5V1 (OlfR5V1. Hs6M1-21). 


CAB65797.1 
C3l.6o9i9o7 


KTIGSKWQPPISSLDSKLTY 


KLTY 


X 




Olfectory receptor 6A1 (OlfR6A1, Olfactory 
receptor n-oo, ORi i-oo) 


AF06587ai 
oi.OOOlDlO 


CILHLYQHQDPDPKKGSRNV 


SRNV 


X 




Olfactory receptor 6B1 (OlfR6B1 , Olfactory 
receptor 7-3, OR7.3, OR6B1) 


AAC64377.1 
Gl:3766132 


NREVKEALKKLAYCXaASRSD 


SRSD 






Olfactory receptor 7A10 (OlfR7A10. OST027) 


AAC25627.1 
Gl:3290001 


YSLRNKHIKGAMKTFFRGKQ 


RGKQ 






Olfactory receptor 7A17 (OlfR7A17) 


AAB82060.1 
G!:2447219 


YSLRNKDIKRALKMSFRGKQ 


RGKQ 






Olfactory receptor 7A5 (OlfR7A5, Olfactory 
receptor TPCR92). 


Y1 0530.1 
Gl:2792017 


ALGIHLLWGTMKGQFFKKCP 


KKCP 






Olfactory receptor 7C1 (OlfR7C1. Olfactory 
receptor TPCR86). 


AAC25626.1 
GI.32o9999 


LGRLLSRATFFNGDITAGLS 


AGLS 






Olfactory receptor 7C2 (OlfR7C2, Olfactory 
receptor i9-io, ORi9-io) 


AAC1 5755.1 
Gi.310o02o 


LGRLLLRATSLXEGTIAKLS 


AKLS 






Olfactory receptor 89 {OlfR89) 


AJ132194.1 
Gl:41 60227 


NVKGALRNLVRSISALSDSG 


SDSG 


X 




Olfactory receptor 888 (OlfR8B8, Olfactory 
receptor TPCR85, olfactory-like receptor JCG8) 


AF238488.1 
Gl:1 701631 8 


LRNKDVKVALKKILNKNAFS 


NAFS 






Olfactory receptor 8D2 (OlfR8D2, Olfactory 
receptor-like protein JCG2) 


AF162668.1 
Gl: 12002781 


LRNKDVKNALKKMTRGRQSS 


ROSS 






Olfactory receptor H17 (OIfRHI7) 


AAG45208.1 
Gl:12007438 


CTLHLYQHQDPDPKKASRNV 


SRNV 


X 




Opioid receptor mu 1 (m1 OpioiR) 


CAC1 5482.1 
Gl:1 1128469 


RDHPSTANTVDRTNHQVRSL 


VRSL 


X 




Opioid receptor type delta (d1 OpioiR, DOR-1) 


U1 0504.1 
Gl:501144 


ARERVTACTPSDGPGGGAAA 


GAAA 






Opioid receptor type kappa (k1 OpioiR, KOR-1) 


U1 1053.1 
Gl:532059 


RNTVQDPAYLRDlDGMNKPV 


NKPV 


X 




Opioid receptor type kappa 3 (k30pioiR, 
Nociceptin receptor, (Orphanin FQ receptor, 
Kappa-iype o opioio recepior, j\\jt\-o) 


X77130.1 
Gl:471316 


SIAKDVALACKTSETvPRPA 


PRPA 


X 




Opioid receptor type mu (mOpioIR, MOR-1 ) 


AAA20580.1 


TVDRTNHQLENLEAETAPLP 


APLP 






Opsin 3 (Encephalopsin, Panopsin) 


AF140242.1 


VDDSDKTNGSKVDVIQVRPL 


VRPL 


X 




Opsin 4 (Melanopsin) 


AF147788.1 

Vjl.DOyO/UU 


HEAETPGKTKGLIPSQDPRM 


DPRM 






Orexin receptor type 1 (Ox1r, Hypocretin 


AF041 243.1 
rii -90071 o'a 


CSISKISEHWLTSVnVLP 


TVLP 






Orexin receptor type 2 (Ox2r. OX2R, Hypocretin 
receptor type 2) 


AF041 245.1 


VLTSISTLPAANGAGPLQNW 


LQNW 






Oxytocin receptor (OT-R, OxytocR) 


X64878.1 


SFVLSHRSSSQRSCSQPSTA 


PSTA 


X 




P2Y purinoceptor 1 (P2Y1 R, ATPreceptor, 
T 1 , r^unricrgic recepior^ 


Z49205.1 


SEDMTLNILPEFKQNGDTSL 


DTSL 


X 


AA330 


P2Y purinoceptor 10 (P2Y10. P2Y-llke receptor) 


AF000545.1 
Gl:21 04786 


GSSVTRSRLMSKESGSSMIG 


SMIG 






P2Y purinoceptor 1 1 (P2Y1 1) 


AJ298334.1 
Gl:12964589 


PLNATAAPKPSEPQSRELSQ 


ELSQ 


X 




P2Y purinoceptor 2 (P2Y2, P2U purinoceptor 1, 
P2U1 , ATP receptor, Purinergic receptor) 


U07225.1 
Gl:984506 


DFRRTESTPAGSENTKDIRL 


DIRL 


X 




P2Y purinoceptor 4 (P2Y4, Uridine nucleotide 
receptor, UNR, P2P) 


X91 852.1 
Gl: 1124904 


CRWAATPQDSSCSTPRADRL 


ADRL 


X 




P2Y purinoceptor 5 (P2Y5, Purinergic receptor 
5, RB intron encoded G-protein coupled 
receptor) 


AF000546.1 
Gl:2232068 


FIQHNLQTLKSKIFDNESAA 


ESAA 


X 




purinooepior i [y^i i ^ LeuKOiriene ty\ 
receptor, Chemoattractant receptor-like 1) 


Gl:1469913 




r>icLiN 






P?Y ni irinopf»nfnr Q Piirinprnip rpr!Pntnr 
■ ^ 1 |>^ui II ? \t ^ 1 ^r\, r uiiiici^iv^ ic^^s^iwi 

9. GPCR GPR23, P2Y5-like receptor) 


Gl: 17531 GO 


FFVSDOTTNNGRELML ESTF 


ESTF 


X 




Parathyroid hormone receptor precursor 
(PTH2Rpre. PTH2 receptor, PTHR2) 


U25128.1 
Gi:887966 


RPMESNPDTEGCQGETEDVL 


EDVL 


X 


AA268 


Parathyroid hormone/ parathyroid hormone- 
related peptide receptor precursor (PTHRpre, 
PTH/PTHR receptor. PTHR1, PTHR, PTRR) 


L04308.1 
Gl:190721 


EASGPERPPALLQEEWETVM 


ETVM 


X 




Peropstn (Visual pigment-like receptor 
peropsin) 


AF012270.1 
Gl:2307009 


PVTSILPMDVSQNPLASGRI 


SGRI 


X 





148 



Pituitary adenylate cyclase activating 
polypeptide type 1 receptor precursor 
(PACAPRIp, ADCYAP1R1, PACR, PACAP 
type 1 receptor) 


NP 001109.1 
Gl:4501923 


LSKSSSQIRMSGLPADNLAT 


NLAT 






Platelet activating factor receptor (PAF-R) 


M80436.1 
GI: 189537 


DTVTEWVPFNQIPGNSLKN 


SLKN 






Probable G protein-coupled receptor GPR32 


AF045764.1 


RAFGEEEFLSSCPRGNAPRE 


APRE 






Probable G protein-coupled receptor GPR35 
(GPCR35) 


AF027957.1 
Gl:2739108 


AVAPRAKAHKSQDSLCVTLA 


VTLA 


X 




Probable G protein-coupled receptor GPR72 
precursor (GPR72pre. GPR72. KIAA1540) 


AF236081.1 
Gl:7248881 


SQLQSGKTDLSSVEPIvTMS 


VTMo 






Prostacyclin receptor (Prostanoid IP receptor, 
PGl receptor, r i oiK, hkikk; 


L29016.1 


SGSAVGTSSKAEASVACSLC 


CSLC 


X 




Prostaglandin D2 receptor (ProstD2R, 
Prostanoid D Preceptor, PGD receptor) 


Q13258 
Gl:2495009 


IRPLRYRSRCSNSTNMESSL 


ESSL 


X 




Prostaglandin E2 receptor, EP1 subtype 
(PE2Rep1, Prostanoid EP1 receptor, PGE 
receptor. EP1 subtype, PE21, PTGER1). 


L22647.1 
Gl:410208 


PSAWEASSLRSSRHSGLSHF 


Lonr 


X 




Prostaglandin E2 receptor, EP2 subtype 
(PE2Rep2, PTGER2, Prostanoid EP2 receptor, 
PGE receptor, EP2subtype). 


U1 9487.1 
Gl:639719 


QDATQTSCSTQSDASKQADL 


QADL 


X 




Prostaglandin E2 receptor, EP3 subtype 
(PE2Rep3, Prostanoid EPS receptor, PGE 

receptor. EP3 subtype) 


U13218.1 
Gl.Oo2745 


STSLPCQCSSTLMWSDHLER 


HLER 






Prostaglandin E2 receptor, EP4 subtype 
(PE2ep4, Prostanoid EP4 receptor, 
PGEreceptor, EP4subtype). 


AAA36434.1 
GI:45249o 


GSSLQVTFPSETLNLSEKCI 


EKCI 


X 




Prostaglandin EP3 receptor (ProsEP3R) 


BAA1 9952.1 
Kji.Zi 14 lyi 


LPLTLASFKLLREPCSVQLS 


VOLS 






Prostaglandin EP3 receptor subtype isoform 
(PEP3isof) 


D86097.1 
Gl:21 02644 


QVPRTWCSSHDREPCSVQLS 


VOLS 






Prostaglandin F2-alpha receptor (PF2aR, 

Prostanoid FP receptor, PGF receptor, PGF2 
alpha receptor. PTGFR) 


AF004021.1 
Gl:2257849 


NSLKVAAISESPVAEKSAST 


SAST 






Proteinase activated receptor 1 precursor (PAR- 
1, Thrombin receptor, Coagulation factor II 
receptor) 


M62424.1 
Gl:339676 


SKMDTCSSNLNNSIYKKLLT 


U'l 1 T 

TSJLLJ 






Proteinase activated receptor 2 precursor (PAR- 
2, Thrombin receptor- like 1, Coagulation factor 
II receptor-like 1) 


Z49993.1 
Gl:1008084 


KHSRKSSSYSSSS 1 I TSY 


KTSY 


X 




Proteinase activated receptor 3 precursor (PAR- 
3, Thrombin receptor- like 2, Coagulation factor 
II receptor-like 2) 


U92971.1 
GI.19oo374 


PFLYFLMSKTRNHSTAYLTK 


YLTK 






Proteinase activated receptor 4 precursor (PAR- 

4, Thrombin receptor- like 3, Coagulation factor 

II receptor-like 3) 


AF080214.1 


SKASAEGGSRGMGTHSSLLQ 


SLLQ 


X 




Putative G protein-coupled receptor 54 
(GPCR54, GPR54) 


/WU01UO0.1 

Gl:14041797 


riQQr^i AAPr^i p\/i npr^MAPi 


NAPL 


X 




Putative G protein-coupled receptor 92 
(GPCR92) 


Gl:9843745 


RP^n^H<?l <?«?FTOnPODSAL 
rsi OL'onoi-ooi i \u.\^t\jiLj\jr^i— 


DSAL 


V 

A 




Putative G protein-coupled receptor GPR44 
(Chemoattraclant receptor- homologous 


ADUUODOO.1 

Gl:4204215 




STSS 






Putative G-Protein coupled receptor, EDG6 
precursor (EDG6pre, Hypothetical protein). 


AJ000479.1 
Gl:3805931 


RSLSFRMREPLSSISSVRSI 


VRSI 


X 




Red-sensitive opsin (Red cone photoreceptor 
pigment) 


Ml 3300.1 
Gl:180696 


SELSSASKTEVSSVSSVSPA 


VSPA 


X 




Retinal G protein coupled receptor 


BC01 1349.1 
Gl:15030185 


VCRGIWQCLSPQKREKDRTK 


DRTK 






Rhodopsin (Opsin2) 


AAC31763.1 
GI* 12361 37 


GDDEASATVSKTETSQVAPA 


VAPA 


X 




Secretin receptor precursor (SecrRpre, SCT- 
R). 


U20178.1 
Gl:662795 


NSTKASHLEQSQGTCRTSII 


TSII 


X 




Serotonin receptor 5-hydroxytryptamine 1A 
(5HT1A, G-21, ser-5-hydroxytryptamine 1A 
receptor) 


M28269.1 

Gl:1 89927 


FNKDFQNAFKKIIKCKFCRQ 


FCRQ 






Serotonin receptor 5-hydroxytryptamine 1B 
(5HT1B, ser-5-hydroxytryptamine 1B rec. 5- 
HT-1D-beta, serotonin 1D beta receptor, 812) 


D10995.1 
Gl:219678 


MSNEDFKQAFHKLIRFKCTS 


KCTS 
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serotonin receptor 5-hydroxytryptannine 1 D 
(5HT1D. serotonin receptor 5-HT-1 D-alpha, 
HTR1D) 


M89955.1 
Gl:177771 


VFNEEFRQAFQKIVPFRKAS 


RKAS 






Serotonin receptor 5-hydroxytryptamine 1 E 
(5HT1E, serotonin receptor 5-HT1E, S31) 


M91467.1 
Gl: 177773 


SFNEDFKLAFKKLIRCREHT 


REHT 






Serotonin receptor 5-hydroxytryptamine IF (5- 
HT-1 F, serotonin receptor 5HT1 F) 


L05597.1 
Gl:307419 


YTIFNEDFKKAFQKLVRCRC 


RCRC 


X 




Serotonin receptor 5-liydroxytryptamine 2A (5- 
HT-2A, serotonin receptor 5HT2A) 


S42168.1 
Gl:252946 


HSEEASKDNSDGVNEKVSCV 


VSCV 


X 




Serotonin receptor 5-hydroxytryptamine 2B (5- 
HT-2B, serotonin receptor 5HT2B) 


X77307.1 
G 1:4751 97 


DTLLLTENEGDKTEEQVSYV 


VSYV 


X 


AA233L 


Serotonin receptor 5-hydroxytryptamine 2C (5- 
HT-2C, serotonin receptor 5HT2C) 


M81778.1 
G 1:338027 


ENLELPVNPSSWSERISSV 


ISSV 


X 


AA205L 


Serotonin receptor 5-hydroxytryptamine 4 (5- 
HT-4, serotonin receptor 5HT4) 


Y1 2505.1 
Gl:2661756 


ESQCHPPATSPLVAAQPSDT 


PSDT 






Serotonin receptor 5-hydroxytryptamine 5A (5- 
ri I -OA, serotonin receptor on i oA) 


X81411.1 
C5I.5417TD 


YTAFNKNYNSAFKNFFSRQH 


SRQH 






Serotonin receptor 5-hydroxytryptamine 6 (5- 
n 1 -D, serotonin receptor on 1 o) 


L41 147.1 
llbzyzvS 


FNIDPAEPELRPHPLGIPTN 


IPTN 






Serotonin receptor 5-hydroxytryptamine 7 (5- 
HT-7, serotonin receptor 5HT7, 5HTX) 


U68488.1 
Gl:1857144 


H N WLAD KM LTTVE KKVIVl 1 H D 


MIHD 






Smoothened homolog precursor (SMOpre, 
SiVIO, Gx protein). 


U84401.1 
Gl;1813875 


PIHSRTNLMDTELMDADSDF 


DSDF 


X 




Somatostatin receptor type 1 (SS1R, SSR1, 
SSTR1 , SRIF-2) 


M81 829.1 
Gl:307433 


NLESGGVFRNGTCTSRITTL 


ITTL 


X 




Somatostatin receptor type 2 (SSR2, SS2R, 
SSTR2, SRIF-1), 


M81 830.1 
Gl:307435 


LNETTETQRTLLNGDLQTSI 


QTSI 


X 


AA113 


Somatostatin receptor type 3 (SSR3, SS3R, 
SSR-28) 


M96738.1 
Gl:338498 


LLPQEASTGEKSSTMRISYL 


ISYL 


X 




Somatostatin receptor type 4 (SS4R, SSTR4) 


D16826.1 
Gl:693907 


EALQPEPGRKRIPLTRTTTF 


TTTF 


X 


AA24d 


Somatostatin receptor type 5 (SS5R, SSTR5) 


AAK61266.1 
Gl: 14336736 


EATPPAHRAAANGLMQTSKL 


TSKL 


X 




Sphingoslnel -phosphate receptor Edg-8 (SPPR) 


AF31 7676.1 
Gl: 11559845 


TGSPGAPTAARTLVSEPAAD 


PAAD 






Substance-K receptor (NKinin2R, SKR, 

INcUl UMFlIll M icC^piUi, INI\-/ir\y 


M57414.1 

r^l-i QQ-1 riA 

oi. 1 ozf \ -y-t 


GSG LWFG YG LLAPTKTH VE 1 


HVEI 


X 




Substance-P receptor (SPR, NK-1 receptor, 


S62045.1 


SRSDSKTMTESFSFSSNVLS 


NVLS 






Thromboxane A2 receptor (TBXA2R, TXA2-R, 
rTosianoiu i " reoepior^ 


U11271.1 


ASRVQAILVPQPPEQLGLQA 


GLQA 




• 


Thyrotropin receptor precursor (TSH-R, 
1 f lyiUlU siiiTiuiauny nuiriiune leocpiory 


M73747.1 

OI.9U0/09 


SHLTPKKQGQISEEYMQTVL 


QTVL 


X 




Thyrotropin-releasing hormone receptor (TRH- 
lA, 1 nyioiiDciin icucpiui ^ 


D16845.1 

OI.«7r / OO 1 


ATKVSFDDTCLASEVSFSQS 


FSQS 






Trace amine receptor 1 (AmineRI ) 


AF380185.1 


FGKIFQKDSSRCKLFLELSS 


ELSS 






Trace amine receptor 3 (AmineR3) 


AF380189.1 
Gl:14600081 


KVLRTDSSTTNLFSEEVETD 


VETO 






Trace amine receptor 4 (AmineR4) 


AF380192.1 


VTGQVLKNSSATIVINLFSEHI 


SEHI 


X 




Trace amine receptor 5 (TA5, GPR102) 


AF411116.1 
Gl: 16566343 


LILSGDVLKASSSTISLFLE 


LFLE 






Urotensin il receptor (UR-II-R) 


AF1 40631.1 
Gl:5902615 


LVLAPAAPARPAPEGPRAPA 


RAPA 


X 




Vasoactive intestinal polypeptide receptor 1 
precursor (VIPR1 , Pituitary adenylate cyclase 
activating polypeptide type II receptor, PACAP 
type il receptor, PACAPR2) 


U1 1087.1 
GI:508258 


TRVSPGARRSSSFQAEVSLV 


VSLV 


X 




Vasoactive intestinal polypeptide receptor 2 
precursor (VIPR2pre, VIP-R-2, Pituitary 
adenylate cyclase activating polypeptide type III 
receptor, PACAP type III receptor, PACAP-R-3, 
Helodennin-preferring VIP receptor) 


L40764.1 
Gl:712836 


LQFHRGSRAQSFLQTETSVI 


TSVI 


X 


AA329 


Vasopressin receptor type 2 {VasoprR2) 


AF032388.1 
Gl:2654030 


VQLWAAWDPEAPLEGGCSRG 


CSRG 






Vasopressin Via receptor (V1aR. 
Vascular/hepatic-type arginine vasopressin 
receptor, Antidiuretic homione receptor 1a, 
AVPRVIa) 


AAA62271.1 
Gl:667068 


GMWKDSPKSSKSIKFIPVST 


PVST 
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Vasopressin V1 b receptor (VI bR, AVPR V1 b, 
Vasopressin V3 receptor, AVPR V3, Antidiuretic 

hnrrnrtnp r^f^ntor 1hN 

IIUIIIIV^IIC tsS\AS\J\\Jf 'VJ 


D31 833.1 
Gl:563981 


ESPRDLELADGEGTAETIIF 


TIIF 


X 




Vasopressin V2 receptor (Renal-type arginine 
vasopressin receptor, Antidiuretic honmone 
receptor, AVPR V2) 


U04357.1 
Gl:30b4498 


GPQDESCTTASSSLAKDTSS 


DISS 






Vomeronasal receptor 1 (VonnNasRI , Putative 
pheromone receptor VI RL1 long form, 
VNR19I1, V1RL1). 


AF302903.1 
Gl:10732801 


QFCFACRTRKTLFPNLWMP 


WMP 






Y6 encoding protein (Y6 protein) 


D86519.1 
01:1731789 


GACWLPRISSMSSLTGIMRC 


IMRC 


X 
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Table 4. 



r^POR nana 


ntsi i rt i n n 
~L/4L-UUIIIclll III 

gene 


Domain(s) 


alphal A-Adrenergic receptor 


nNOS 




beta2-Adrenergic receptor (DSLL) 


EBP 50 


1 


beta2-Adrenergic receptor (DSLL) 


SIP1 


1 


P2Y1 purinergic receptor (DTSL) 


EBP50 


1 


GRK6A (TRL) 


EBP50 


1 


beta1 -Adrenergic receptor (DSLL) 


rat PSD95 


3 


parathyroid hormone 1 receptor 


SIP1 


2 


parathyroid hormone 1 receptor 


EBP50 


na 


5HT2B 


CNOS 




platelet-derived growth factor receptor 


EBP50 




mGLURIa 


shank 




mGLUR5 


shank 




SSTR2 


shank 1 




SSTR2 


shank2 




IL8RB 


RGS12 




CL1 (a-latrotoxin) 


shank 




5HT2B 


InadI 


6 


B1AR 


MAGI2 


1 


rat SSTR2 


CBP1 




5HT2C 


MUPP1 


10 


SSTR2A 


CBP1 




CIRL1 


shank2 




CIRL2 


shank2 




CIRL1&2 


shank family 




prolactin-releasing peptide receptor 


GRIP 




prolactin-releasing peptide receptor 


ABP 




prolactin-releasing peptide receptor 


PICK1 




kappa opioid receptor 


EBP50 


1 


mGLUR7 


PICK1 
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Table 6. 



Gene Name 


Gl or Acc* 


Domain 
# 


Sequence fused to GST Construct 


26s subunit 
P27 


918438S 


1 


RDMAEAHKEAMSRKLGQSESQGPPRAFAKVNSISPGSPSIAGLQVDDBVEFGS 
VNTQNFQSLHNIGSWQHSEGALAPTiaSVSM 


AF6 


43099C 


1 


LRKEPEIITVTI-KKQNGMGLSIVAAKGAGQDKLGIYVKSNA/KGGAADVDGRIJVA 
GDQLLSVDGRSLVGLSQERAAELMTRTSSWTLEVAKQG 


AlPC 


1275145^ 


1 


URPSVISIIGLYKEKGKGLGFSIAGGRIXJIRGQMGIFWTIFPNGSAAEDGRLKEGDE 
ILDVNGIPIKGLTFQEAIHTFKQIRSGLFVLTVRTKLVSPSLTNSS 


AlPC 


12751451 


2 


GISSLGRKTPGPKDRIVMEVRJVlKEPRVGLGIGACCLALENSPPGmHSUVP^ 

AKMESNLSRGDQILEVNSVNVRHAALSKVHAILSKCPPGPVRLVIGRHPNPKV 

SEQEMDEVIARSTYQESKEANSS 


AlPC 


12751451 


3 


QSENEEDVCFIVLNRKEGSGLGFSVAGGTDVEPKSITVHRVFSQGAASQEGTMN 
RGDFLLSVNGASLAGU\HGNVLKVLHQAQLHKDALWIKKGMDQPRPSNSS 


AlPC 


12751451 


4 


LGRSVAVHDALCVEVLKTSAGLGLSLDGGKSSVTGDGPLVIKRVYKGGAAEQA 
GIIEAGDEILAINGKPLVGLMHFDAWNIMKSVPEGPVQLLIRKHRNSS 


alpha actinin-2 
associated LIM 
protein 


277305£ 


1 


QTVILPGPAAWGFRLSGGIDFNQPLVITRITPGSKAAAANLCPGDVILAIEXaFGTE 
SMTHADGQDRIKAAEFIV 


APXL-1 


1365126G 


1 


ILVEVQLSGGAPWGFTLKGGREHGEPLVIIKIEEGSKAAAVDKLLAGDEIVGINDI 
GLSGFRQEAICLVKGSHKTIKLWKRNSS 


Atrophin-1 
Interacting 
Protein 


2947231 


1 


REKPLFmDASQLKGTFLSTTlKKSNMGFGFTIIGGDEPDEFLQVKSVIPEX3PAAQ 
DGKMETGDVIVYINEVCVLGHTHADWKLFQSVPIGQSVNLVLCRGYP 


Atrophin-1 
Interacting 
Protein 


2947231 


2 


LSGATQAELMrLTIVKGACK3FGFTIADSPTGQRVKQILDIQGCPGLCEGDUVEIN 
QQNVQNLSHTEWDILKDCPIGSErSUIHRGGFF 


Atrophin-1 
Interacting 
Protein 


2947231 


3 


HYKELDVHLRRMESGFGFRILGGDEPGQPILIGAVIAMGSADRDGRLHPGDELVY 
VDGIPVAGKTHRYVIDLMHHAARNGQVNLTVRRKVLCG 


Atrophln-1 
Interacting 
Protein 


2947231 


4 


EGRGISSHSLQTSDAVIHRKENEGFGFVIISSLNRPESGSTITVPHKIGRIICXSSPAD 
RCAKLKVGDRILAVNGQSIINMPHADIVKLIKDAGLSVTLRIIPQEEL 


Atrophin-1 
Interacting 
Protein 


2947231 


5 


LSDYRQPQDFDYFT/DMEKGAKGFGFSIRGGREYKMDLYVLRLAEDGPAIRNGR 
MRVGTOIIHNGESTODIVrmARAIEUKSQGRRVRLlI^ 


Atrophin-1 
Interacting 
Protein 


2947231 


6 


HESVIGRNPEGQLGFELKGGAENGQFPYLGEVKPGKVAYESGSKLVSEELLLEV 
NETPVAGLTIRDVLAVIKHCKDPLRLKCVKQGGIHR 


CARD11 


12382772 




NLMFRKFSLERPFRPSNn-SVGHVRGPGPSVQKTTUvJGDSLTSQLTLLGGNARG 

SFVHSVKPGSLAEKAGLREGHQLLLLEGCIRGERQSVPLDTCTKEEAHWnQRC 

SGPVTLHYKVNHEGYRKLV 


CARD14 


1312912C 




ILSQVTMIJ\FQGDALLEQISVIGGNLTGIFIHRVTPGSAADQMALRPGTQIVMVD 
YEASEPLFKAVLED1TLEEAVGLLRRVDGFCCLSVKVNTDGYKRL 


CASK 


3087815 




TOVRLVQFQKNTDEPMGITU<MNEIJvJHCIVARIMHGGMIHRCX3U>HV^ 
GISVANQTVEQLQKMLREMRGSITFKIVPSYRTQS 


Connector 
Enhancer 


393078C 




LEQKAVLEQVQLDSPLGLEIHTTSNCQHFVSQVDTQVPTDSRLQIQPGDEW 

QINEQVWGWPRKNMVRELLREPAGLSLVLKKIPIP 


Cytohesin 

Binding 

Protein 


319290€ 




QRKLVTVEKQDNETFGFEIQSYRPQNQNACSSEMFTLICKIQEDSPAHCAGLQA 
GDVLANINGVSTEGF7YKQWDLIRSSGNLLTIETLNG 


Densin 180 


1675589S 




RCUQTKGQRSMIXiYPEQFCVRIEKNPGLGFSISGGISGCX3hPFKPSC^ 
PDGPASNLLQPGDKILQANGHSFVHMEHEKAVLLLKSFQNTVDLVIQRELTV 


DLG1 


47581 ^ 




IQVNGTDADYEYEEIUERGNSGLGFSIAGGTDNPHIGDDSSIFrrXIITGGAAAQD 
GRLRVNDCILQVNEVDVRDVTHSKAVEALKEAGSIVRLYVKRRN 


DLG1 


475816 


2 


IQLIKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIEGGAAHKDGKLQIGDKLLAVNN 
VCLEEVTHEEAVTALKNTSDFVYLKVAKPTSMYMNDGN 



153 



DLG1 


475816 


3 


11 HRf^STf^l GFNIVAf^FrK^FfilFI^FIl ARRPAD! Sf5FLRKGnRII^\/NWni RAA^i 
HEQAAAALKNAGQAVTIVAQYRPEEYSR 


DLG2 




-| 


ISYVNGTFIFYFFFEm FRGNSGL GFSIAGGTDNPHIGDDPGIFrn<IIF>ftttAAAFnf^ 
RLRVNCX;iLRVNEVDVSEVSHSKAVEALKEAGSIVRLYVRRR 


DLG2 




2 


I^WFIKL FKGPKGLGFSIAGGVGNOHIPGDNSI YVTKII DGOAAOKrY^Rl OVftnRI 
LMVNNYSLEEVTHEEAVAILKNTSEWYLKVGNPTTI 


DLG2 


12736552 


3 


IWAVSLEGEPRKWLHKGSTGLGFNIVGGEDGEGIFVSFIU\GGPADLSGELQRG 
DOILSVNGIDLRGASHEOAAAALKGAGOTVTIIAOYOPFD 


DLG5 


3650451 




GIPYVEEPRHVKVQKGSEPLGISIVSGEKGGIYVSKVm/GSIAHQAGLEYGDQLLE 
FMf^lM R^ATFOOARI lir^OOnnTITIl AOYIMPHVHOI RM^^7I TD 


DLG5 


3650451 




GILAGDANKKTLEPRWFIKKSQLELGVHLCGGNLHGVFVAEVEDDSPAKGPD 
f^l \/Pf5ni II FYf^^l DVRNKTVFEVYVFMLKPRDGVRLKVOYRPFFFIVm 

VJIL. V 1 VJIL^LIIwL^ 1 VJI wL.I_/ V III ^1 X 1 V[_[_V 1 V ^IVI^IXl lil^VJI V ll^l x V Vn 1 ill ^E^l^l V 1 1_/ 


DLG6, splice 

VCIIICIIIl 1 


1464714C 




PTSPEIQELRQMLQAPHFKALLSAHDTIAQKDFEPLLPPLPDNIPESEEAIVIRIVC 

1 VKKiOOPL GATIKRHFMTGniL VARI IHGGLAFRSGIJ-YAGDKLVFVN 
L.vr\i^VrfV«(i LAj^ 1 iiMin^ivi 1 \jiL/i^ vr>riiiriwiv^L.^^^rio\jii— L. i n\jiLyr\^v&vi>i\jivovc.\Ji 

LDPEQVIHiLAMSRGTIMFKWPVSDPPVNSS 


variant 2 






PT<=5PFIOFI ROMl OAPHFKGAT1KRHFMTGDII VARIIHGGLAFR^^Gl 1 YAf^HKI 
VEVNGVSVEGLDPEQVIHILAMSRGTIMFKWPVSDPPVNSS 


DVL1 


2291005 




LNI\/T\/TLNM ERHH FLGISl VGOSNDRGDGGI YIGSIMKGGAVAADG RIEPGDMLL 

l_l>IIV 1 V 1 UNIVItill 11 II l_\Jllwl VV3Vj(OINLyil>JILa/VJIVJI I Iw^MVirxVJiVJI^ VrV^L^V^rilC^i V3liyiVil_^ 

QVNDVNFENMSNDDAVRVLREIVSQTGPISLTVAKGW 


n\/l P 


1 \J\J f 


^ 


1 Nirrvn NMFKYNFI nRIV/fjO^NFRf^rVnniYin^lMKt^fiAVAAnfiRIFPRDMI 1 o 
VNDI^NFENMSNDDAVRVLRDIVHKPGPIVLTVAKCWDPSPQNS 


Ly V 






irrvn NMFKYNFI fil^^lVf^O^NFRnrx^f^lYltt^lMKOttAVAADGRIFPGDMI 1 OVN 

II 1 V 1 UNIVI^rx 1 INI L.VJIIOI V VJIVi«(OI>l^livJILyVJIVJII T iVJOIiVirxVJVJir^ VfVALyVjril^r \JLy|VII,.t.Vi< VIN 

EINFENIVISNDDAVRVLREIVHKPGPITLTVAKCWDPSP 


Fl FIN 1 


9QR71 4Z 




TTOOini OnPRPWf^FRI VfiRKDFFOPI AISRVTPR^KAALANLniRnX/ITAIfy^F 
NTSNMTHLEAQNRIKGCTDNLTLTVARSEHKVWSPLV 




OD 1 QOv. 




IFIWin*^FK\A/l Ff^PAPWf^FRl Or^f^KnFMVPi *^I*^RI TPnr^KAAOAf^VAX/nDWVI 
SIDGENAGSLTHIEAQNKIRACGERLSLGLSRAQPV 








rV^WFI Al^riFIH\/R\/FKnPFi r5F<5IQfV^\/<'^Rr5MPFRPnnrY^IF\/TR\/OPFr5PAQK 

LLQPGDKIIQANGYSFINIEHGQAVSLLKTFQNTVELIIVREVSS 


P7RiM RinHinn 

Protein 50 


O^^UU 1 c 




II PPI FKriPMf^YnFHI Hf^FKHKI f^OYIRl \/FPr5<^PAFKA<^l 1 Af^DRI \/F\/Nf5FN 

VEKETHQQVVSRIRAALNAVRLLVVDPEFIVTD 


F7RiNI RInHinn 

Protein 50 


O^^UU 1 c 


2 


IRI PTTWIKKf^P^nYnFNI H^nK*5KP<^OFIRWnPn*^PAFA^f5l RAOnRI\/FVNn\/ 
CMEGKQHGDWSAIRAGGDE7KLLWDRErDEFFI\/lNSS 


Fl innon 






KNPC^r^FI KTVn ^KMKO^I nRI<^r^lF^KVOPM\/KIFKIFF*nnAAR ^HAI OAQFF 
rxiNi ouid_rx 1 V 1 LorxivirxvxoL.vjiioiovjiVJii^orxv v^ir ivi v ixitzixJi i vjva/A/Ai l.\Jvji/ai_Vj</av3i t 

LVAVDGENLEQVTHQRAVDTIRRAYRNKAREPMELWRVPGPSPRPSPSD 


Fl IHPI*^ 

1 l_«J 1 1 ^ 1 o 






F(^H<^HPR\A/FI PKTFFf^l r5FNIMf5r5KFON9PIYRRIIPfTniAnRHr^(^l KRHDOI L5? 
Cvjnoni nv vclj rx i c^vjii^vjii iNiivivjivjirx^v^iNOi 1 1 toriiir vJvjiifALyrii i^jvjlj \rivj3L./xxL-i_o 

VNGVSVEGEHHEKAVELLKAAQGKVKLWRYTPKVLEEME 


Fl I1P4PR 






Pn A P Y AR KTFTI DA Vf^ WG FWRn PCH lO AVnPSG PAAA AGM KVOOFNA/ 
SVNGLNVLHVDYRTVSNLILTGPRTIVMEVMEELEC 




104'?4Pnc 




nOYfifiFTVKIVRIFKARDIPI GATVRNFMD^VIISRIVKGGAAFKSGLLHEGDEVL 
EINGIEIRGKDVNEVFDLLSDMHGTLTFVLIPSQQIKPPPA 


Fl IPnOTR 


701 00*^? 




II AHVKGIFKFVNWK^Fn^^l Gl TITnNnVGYAFIKRIKDGGVIDSVKTinVGnHIF 
iL-rAri vrxVJiicrxOf IN v I rxo^LJOLVJiL 1 1 1 i_/iN\ji v\ji I /Arir\riirxL./vjivj v iLyovrx i vx^L^ni^ 

SiNGENIVGWRHYDVAKKLKELKKEELFTMKLIEPKKAFEl 


Fl l91Rfl7 






KP<50A^nHFWFI VRnVAOFHI Tl r^GnRnX/AGnTR AVRRLL KHRPAORfiGRL 
EVGDLVLHINGESTQGLTHAQAVERIRAGGPQLHLVIRRPLETHPGKPRGV 


FLJ31349 


AK05591 1 


1 


PVMSQCACLEEVHLPNIKPGEGLGMYIKSTYDGLHVITGTTENSPADRSQKIHAG 
nFVION/IMOnTWfiWni kni v/kki rfmptgwi i i kkrptg^fnftpffivtd 

L/C V IVjcVINV«{Vj( 1 V VV3 VVWL-lAlNLVr\r\L.riCINr 1 \JI V V L.I-1-ixrxn 1 1 VJOriNi 1 r I_i 1 V 1 u/ 


FLJ32798 


AK05736C 




LDDEEDSVKIIRLVKNREPLGATIKKDEQTGAIIVARIMRGGAADRSGLIHVGDEL 
RFVMGIPVFnKRPFFIIOII AO^OGAITFKIIPG^IKFFTPSNSS 


GRIP1 


4539083 




WElJVIKKEGTTLGLTVSGGIDKDGKPRVSNLRQGGIAARSCXaLDVGDYIKAVNG 
INLAKFRHDEIISLLKNVGERWLEVEYE 


GRIP1 


453908C 


2 


RSSVIFRTy/EVnJHKEGMTFGFVIRGGAHDDRNKSRPWITGVRPGGPADREGTl 

KPGDRLLSVDGIRLLGTTHAEAMSILKQCGQEAALLIEYDVSVMDSVATASGN 
SS 


GRIP1 


453908C 


3 


HVATASGPLLVEVAKTPGASLGVALTTSfVlCCNKQVlVIDKIKSASIADRCGALH 
VGDHILSIDGTSMEYCTLAEATQFLANTTDQVKLEILPHHQTRLALKGPNSS 


GRIP1 


4539083 


4 


TETTEWLTADPVTGFGIQLQGSVFATETLSSPPLISYIEADSPAERCGVLQIGD 
RVMAINGIPTEDSTFEEASQLLRDSSITSKVTLEIEFDVAES 



154 



GRIP1 


453908G 


5 


AESVIPSSGTFHVKLPKKHNVELGmSSPSSRKPGDPLVISDIKKGSVAHRTGTL 

FLGDKL LAIDNIRL nNCSMFDAVOILOOCFDIVKLKIRKnEDNSn 


GRIP1 


4539082 


6 


lYTVELKRYGGPLGmSGTEEPFDPIIISSLTKGGLAERTGAIHIGDRILAINSSSLKG 
KPLSEAIHLLQMAGETVTLKIKKQTDAQSA 


onln 1 




/ 


IMQDTDV/n Ull^/n Vl^ncnMPnC/SPCV/ArV^I l Pk'^VAA/l^KIIDDA^DOni /^/^l l^D 
llVIOn 1 rVCLnixV 1 LTl\lJoUlvlClJr^roVMIJl3lJ-tt\V3VT VININI^ 

YDRLLQVNHVRTRDFDCCLWPLIAESGNKLDLVISRNPLA 


Activating 

Pn7\/nn^ 

^1 i^jr 1 lie 




1 


QTLPSLRPEAAAQLLRSAPKVCVTVLPPDESGRP 


Guanine 
Factor 


665076£ 


1 


AKAKWRQNA/LQKASRESPLQFSLNGGSEKGFGIFVEGVEPGSKAADSGLKRGD 
OIMEVNGONFENITFMKAVFILRNNTHLALTVKTNIFVFKFL 


HFMBA 
1000505 


1 0436367 

1 \J ^ kJ KJ \J § 


"1 


LENVI AKSLL IKSNEGSYGFGLEDKNKVPI IKLVEKGSNAEM AGM EVGKKI FAING 
DLVFMRPFNEVDCFLKSCLNSRKPLRVLVSTKP 


1000505 


1 04*^6*^67 


o 


PRETV/KIPn^ADGI GFOIRGFGP^WHAVGRGTVAAAAGI HPGOfillKVNRINX/*? 
KETHASVIAHVTACRKYRRPTKQDSIQ 


HEMBA 
1003117 


7022001 


-J 


EDFCYVFT\/ELERGPSGLGMGLIDGMIHTHLGAPGLYIOTLi_PGSPAAAnGRLSL 
GDRILEVNGSSLLGLGYLRAVDLIRHGGKKMRFLVAKSDVETAKKI 


HTRA3 


AY04OOQ^ 


-l 


L TFFOnKOIKnWKKRFIGIRMRTITPfiLVnELKASNPnEPEVSSGIYVOFX/APN^ 
PSQRGGIQDGDIIVKVNGRPLVDSSELQEAVLTESPLLLEVRRGNDDLLFSNSS 


HTRA4 


AL57644^ 


1 


HKKYLGLQMLSLTVPLSEELKMHYPDFPDVSSGWVCKWEGTAAQSSGLRD 
HnVIVNINnKPITTTTnWKAl D^D^l *5MA\/I RGKDNI 1 1 TVISJ9^ 

riL/ V 1 V iNllNVJirxr 1 1 1 1 1 1^ V V lx/AL.I^01^0L0IVI/^ V LriV^rvLylNLLL 1 VINOO 


INADL 


2370146 


1 


IWQIEYIDIERPSTGGLGFSWALRSQNLGKVDIFVKDVQPGSVADRDQRLKEN 
DOILAINHTPLDONI^HOOAIALLOOTTGSLRLIVAREPVHTKSSTSS^iE 


INADL 


23701 4€ 


2 


PGHVEEVEUNDGSGLGFGIVGGKTSGVNA/RTIVPGGLADRDGRLQTGDHI^ 
TNVOGI^ATSFOVAOVI RNGGN^^ 


INADL 


2370146 


3 


PGSDSSLFETYNVELVRKDGQSLGIRIVGYVGTSHTGEASGIYVKSIIPGSAAYHN 
GHIQVNDKIVAVDGVNIQGFANHDWEVLRNAGQVVHLTLVRRKTSSSTSRIH 
RD 


INADL 


2370146 


4 


NSDDAELQKYSKLLPIHTLRLGVEVDSFDGHHYISSIVSGGPVDTLGLLQPEDE 
LLEVNGMOLYGKSRREAVSELKEVPPPFTLVCCRRLFDDEAS 


INADL 


2370146 


5 


LSSPEVKIVELVKDCKGLGFSILDYQDPLDPTRSVIVIRSLVADGVAERSGGLLP 
G DRLVSVN EYCLDNTSLAE AVEI LKAVPPGLVH LG 1 CKPLVEFIVTD 


INADL 


2370146 


6 


PNFSHWGPPRIVEIFREPNVSLGISIWGQTVIKRLKNGEELKGIFIKQVLEDSPAG 
KTNALKTGDKILEVSGVDLONASHSEAVEAIKNAGNPWFIVOSLSSTPRVIPN 
VHNKANSS 


INADL 


pOTn-l AC 


7 


PnFI HIIFI FKDKNfil GA AfnMKnR^RM<^IF\A/n!NPF(nPAAAnGRMRIf5nFL I F 
INNQILYGRSHQNASAIIKTAPSKVKLVFIRNEDAVNQMANSS 


INADL 




3 


PATCPIVI^OEMIIFRKGR^LGLSIVGGKnTPLNAIVIHEVYEEGAAARnGRLW 
% 1 1 vi^\3\j(wiviii^iwr\v3iiOV3i_V3i_oi vvjvjiALy 1 1 u>ir^ Vint V i i^tv^irwAriL/vjiritvv 

AGDQILEVNGVDLRNSSHEEAITALRQTPQKVRLVVY 


KIAA0147 


1 46987£ 




IL TLTILROTGGLGISIAGGKGSTPYKGDDEGIFISRVSEEGPAARAGVRVGDKLLE 
VNGVALQGAEHHEAVEALRGAGTAVQMRVWRERMVEPENAEFIVTD 


KIAAniil7 


1 46987£ 


2 


PLRORHVACLARSFRfnl fiF^nlAfnttKn^TPYRAGDAGIFVSRIAEGGAAHRAGTLO 
VGDRVLSINGVDVTEARHDHAVSLLTAASPTIALLLEREAGG 


KIAAni47 


1469875 


3 


ILEGPYPVEEIRLPRAttttPI ttl SIX/GfiSDHJiSHPFGVOEPGVnSKX/LPRGLAARS 
GLRVGDRILAVNGQDVRDATHQEAVSALLRPCLELSLLVRRDPAEFIVTD 


KIAA0147 
rMMrW If/ 




4 


RFI PIOKAPf^FRl nRIR(^ARnHA(^NIPRnPTnFrilFI<y<V^PTGAAGRr)r5RI RVG 
LRLLEVNQQSLLGLTHGEAVQLLRSVGDTLTVLVCDGFEASTDAALEVS 




99'>4<\4f 




PHOPIViHQj^r^KNYf^FTIRAIRX/YVf^n^niYTVI-IHIVWNX/FFn^PAnOAni KAt^D 

UTHINGEPVHGLVHTEVIELLLKSGNKVSITTTPF 


KIAA0313 


765726C 




ILACAAKAKRRLIVrTLTKPSREAPLPFILLGGSEKG FGI FVDSVDSGSKATEAGLKR 
GDQILEVNGQNFENIQLSKAMEILRNNTHLSITVKTNLFVFKELLTNSS 


KIAA0316 


668312c 




IPPAPRKVEMRRDPVLGFGFVAGSEKPVWRSVTPGGPSEGKLIPGDQIVMINDE 
PVSAAPRERVIDLVRSCKESILLTVIQPYPSPK 


KIAA0340 


222462C 




LNKRTITVIPKDSGALLGLKVVGGKMTDLGRLGAFITKVKKGSLADWGHLRAGD 
EVLEWNGKPLPGATNEEVYNIILESKSEPQVEIIVSRPIGDIPRIHRD 


KIAA0380 


222470C 




QRCVIIQKDQHGFGF7VSGDRIVLVQSVRPGGAAMKAGVKEGDRIIKVNGTMVT 
NSSHLEVVKLIKSGAYVALTLLGSS 



155 



KIAA0382 


7662087 




ILVQRCVIIQKDDNGFGLTVSGDNPVFVQSVKECX3AAMRAGVQTGDRIIKVNGT 
LVTHSNHLEVVKLIKSGSYVALTVQGRPPGNSS 


KIAA0440 


26621 6C 




SVEMTLRRNGLGQLGFHVNYEGIVADVEPYGYAWQAGLRQGSRLVEICKVAV 
ATLSHEQMIDLLRTSVTVKVVIIPPHD 


KIAA0545 


1 476285C 




UCV^^^SGWETVD^/r^LRRNGLGQLGFHVKYDGTVAEVEDYGFAW^ 
RLVEICKVANATTLTHDQMIDLLRTSVTVKWIIPPFEDGTPRRGW 


KIAA0559 


3043641 




HYIFPHARIKITRDSKDHTVSGNGLGIRIVGGKEIPGHSGEIGAYIAKILPGGSAEOT 
GKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGEAEICVRLDLNML 


KIAA0561 


304364£ 




LCGSIJIPPIVIHSSGKKYGFSIJRAIRWMGDSDVYTVHHVVWSVEDGSPAQEA 
GLRAGDLITHINGESVLGLVHMDWELLLKSGNKISLRTTALENTSIKVG 


KIAA0613 


332703S 




SYSVTLTGPGPWGFRLCK3GKDFNMPLTISRITPGSKAAQSQLSQGDLVVAIDGV 
NTDTMTHLEAQNKIKSASYNLSLTLQKSKNSS 


KIAA0751 


1273416£ 




ISRDSGAMLGLKWGGKMTESGRLCAFITKVKKGSLADTVGHLRPGDEVLEWN 
GRLLQGATFEEVYNIILESKPEPQVELWSRPIAIHRD 


KIAA0807 


388233^ 

KJF \J Mmm W 1 




ISALGSMRPPIIIHRAGKKYGFTIJ^AIRWMGDSDVYWHHMVWHVEDGGPASE 

i ^^#^^X,i< wit II 1 II If II If^V^I XI X i ^^1 1 ^9 l^^li 1 V 1 1 VI\^l_/v.7L^ V i 1 W| 11 IIVIVVVl IV ^L^v-4X^i 

AGLRQGDLITHVNGEPVHGLVHTEWELILKSGNKVAISTTPLENSS 


KIAA0858 


424020^ 




FSDMRISINOTPGKSLDFGFTIKWDIPGIFVASVEAGSPAEFSOLOVDDFIIAINNT 
KFSYNDSKEWEEAMAKAQETGHLVMDVRRYGKAGSPE 


KIAA0902 


4240295 




QSAHLEVIQLANIKPSEGLGMYIKSTYDGLHVITGTTENSPADRCKKIHAGDEVI 

XJtx^f^l I^^M ▼ iXfCk^^l ^11 Xi V^^hVi^^Vp^IIpI 1 11X^1^ i 1 fc^V^^i IV 1 1 Xii4 1 1 ^hI V^i^l ^Xb^l l^^lxlxll Ir^X^L^L^Vi 

QVNHQTWGWQLKNLVNALREDPSGVILTLKKRPQSMLTSAPA 


KIAA0967 


4589577 




ILTQTLIPVRHTVKIDKDTLLQDYGFHISESLPLTWAVTAGGSAHGKLFPGDQI 
LQMNNEPAEDLSWERAVDILREAEDSLSITWRCTSGVPKSSNSS 


K1AA0973 


458958S 




GLRSPITIQRSGKKYGFTLRAIRVYMGDTDVYSVHHIVWHVEEGGPAQEAGLCA 

X <l 1 IXi'l 1 1 IXiJCI 1^^V«4I Xi X i V^l 1 Wl l^^ll IV 1 iVIXi<fc^ 1 V 1 VI II IIVVVI IV twivw^^X^I ^^^C^^^VJI^V^r^ 

GDLITHVNGEPVHGMVHPEWELILKSGNKVAVTTTPFE 


KIAA1095 


5889526 




QGEETKSLTLVLHRDSGSLGFNIIGGRPSVDNHDGSSSEGIFVSKIVDSGPAAKE 
GGLQIHDRIIEVNGRDLSRATHDQAVEAFKTAKEPIWQVLRRTPRTKMFTP 


KIAA1095 


5889526 




QEMDREELELEEVDLYRMNSQDKLGLTVCYRTDDEDDIGIYISEIDPNSIAAKDG 
RIREGDRIIQINGIEVQNREEAVALLTSEENKNFSLLIARPELQLD 


KIAA1202 


6330421 




RSFQYVPVQLCXSGAPWGFTIKGGLEHCEPLTVSKIETCGKAAI^KMOT^ 
VNINGTPLYGSRQEALILIKGSFRILKLIVRRRNAPVS 

V 1 ^11 II 1 v^\^ 1 IVjCI— /«L»IL»tl XN^S^I 1 llli^l Xbl V 1 11 II 11 NrXI V \J 


KIAA1222 


63306 1C 




ILEKLELFPVELEKDEIXpLGISIIGMGVGADAGLEKLGIFVKTVTEGGAAQRDGRIQ 
vndoivevdgislvgvtonfaatvlrntkgnvrfvigrekpgovs 

V 1 VLyVj<l V V L«/V^I««J^ V X^ V 1 ^1 r^r^ 1 V ^1 il ^ 1 1 XViJI ^ V 1 11 V IV^I I&mI XI \^^( V \ J 


KIAA1284 


6331 36£ 




KDVNNA^NPKKLTVIKAKEQLKLLEVLVGIIHQTKWSWRRTGKQGDGERLVVH 

GLLPGGSAMKSGOVLIGDVLVAVNDVDVTTENIERVLSCIPGPMOVKLTFENA 

YDVKRET 


KIAA1389 


72431 5£ 




TTRGCETOEMTLRRNGLGOLGFHVNFEGIVADVEPFGFAWKAGLROGSRLVEICK 
VAVATLTHEQMIDLLRTSVTVKVVIIQPHDDGSPRR 


KIAA1415 


724321 C 




VFNII AKRl 1 II POFFDYGFDIFFKNKAWVKWORfi^LAFVAfil 0\/RRKiy*=5IM 
EDLVFLRPFSEVESILNQSFCSRRPLRLLVATKAKEIIKIP 


KIAA1526 


5817166 




PDSAGPGEVRLVSLRRAKAHEGLGFSIRGGSEHGVGIYVSLVEPGSLAEKEGLR 
VGDQILRVNDKSLARVTHAEAVKALKGSKKLVLSVYSAGRIPGGYVTNH 


KIAA1526 


5817166 


2 


L(>3GDEKKVNLVLG[X3RSLGLTlRGGAEYGLGIYrrGVDPGSEAEGSGLKVGDQI 
LEVNWRSFLNILHDEAVRLLKSSRHLILTVKDVGRLPHARTTVDE 


KIAA1526 


581 7166 


3 


WTSGAHVHSGPCEEKCGHPGHROPLPRIVTIORGGSAHISICGOLKVGI-IVILEVN 
GLTLRGKEHREAARIIAEAFKTKDRDYIDFLDSL 


KIAA1620 


10047316 


1 


ELRRAELVEIIVETEAQTGVSGINVAGGGKEGIFVRELREDSPAARSLSLQEGCX) 
LLSARVFFENFKYEDALRLLQCAEPYKVSFCLKRTVPTGDLALRP 

i ,%^\Jf\M 1 V 1 1 L^t ^1 1 X 1 %^m^r^t^l Ib^lit.^XMX^'^k^l 1 I X V \^l X^/^l Xi 1 1 V i 1 1 1 


KIAA1634 


10047344 


1 


PSQLKGVLVRASLKKSTMGFGFTIIGGDRPDEFLQVKNVLKDGPAAQDGKIAPG 
DVIVniNinNPVI nHTHAnWOMFOI VPVNOYVNI Tl CRGYPL PDD*^FD 


KIAA1634 


10047344 


2 


ASSGSSQPELNHIPLIKGPKGFGFAIADSPTGQKVKMILDSQWCQGLQKGDIIKEIY 
HQNVQNLTHLQVVEVLKQFPVGADVPLLILRGGPPSPTKTAKM 


KIAA1634 


10047344 


3 


LYEDKPPLTNTFLISNPRTTADPRILYEDKPPNTKDLDVFLRKQESGFGFRVLGG 
DGPDQSIYIGAIIPLGAAEKDGRLRAADELMCIDGIPVKGKSHKQVLDLMTTAAR 
NGHVLLTVRRKIFYGEKQPEDDSGSPGIHRELT 


KIAA1634 


10047344 


4 


PAPQEPYDNA/LQRKENEGFGFVILTSKNKPPPGVIPHKIGRVIEGSPADRCGKLK 
VGDHISAVNGQSIVELSHDNIVQUKDAGVTVTLTVIAEEEHHGPPS 


KIAA1634 


10047344 


5 


QNLGCYPVELERGPRGFGFSIJRGGKEYNMGIJ=imij^EEX3PAIKCX3RIHVGDQI^ 
INGEPTQGITHTRAIELIQAGGNKVLLLLRPGTGLIPDHGLA 
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KIAA1719 


1 267985 


0 


nVVFL IKKFGSTIGL TISGGTDKDGKPRVSNLnPGG 
HLTRLRHDEIITLLKNVGERWLEVEY 


KIAA1719 


1267985 


1 


ILDVSLYKEGNSFGFVLBGGAHEDGHKSRPLVLTYVRPGGPADREGSLKVGDRL 
LSVDGIPLHGASHATALATLRQCSHEALFQVEYDVATP 


KIAA1719 


1267982 


2 


IHTVANASGPI-MVEIVKTPGSALGISLTTTSLRNKSVITIDRiKPASVVDRSGALH 
PGDHILSIDGTSMEHCSLLEATKLLASISEKVRLEILPVPQSQRPL 


KIAA1719 


1267982 


3 


IQIVi-rrETTEWLCGDPLSGFGLQLQGG 1 FATETLSSPPLVCFI EPDSPAERCGLL 
QVGDRVLSINGIATEDGTMEEANQLLRDAALAHKVVLEVEFDVAESV 


KIAA1719 


1267982 


4 


IQFDVAESVIPSSGTFHVKLPKKRSVELGmSSASRKRGEPLIISDIKKGSVAHRTG 
TLEPGDKLLAIDNIRLDNCPMEDAVQILRQCEDLVKLKIRKDEDN 


KIAA1719 


1 267982 


5 


IQTTGAVSYTVELKRYGGPLGITISGTHEPFDPIVISGLTKRGLAERTGAIHVGDRIL 
AINNVSLKGRPLSEAIHLLQVAGETVTLKIKKQLDR 


KIAA1719 


1267982 


6 


ILEMEELLLPTPLEMHKNm-HKDPMRHDFGFSVSCXBLLEKGVYVHTVRPDGPA 
HRQGLQPFDRVLQVNHVRTRDFDCCLAVPLLAEAGDVLEUISRKPHTAHSS 

1 II IV^Vrf^NMI I fc^l 1 * bXJC V 1 ^1 1 w 1 11 1 Ih^l &^%/Xi#k^^ w I W»*— *^^™»»V»«fc^ V ^^mWIIV^I 11 XI III r^l WJf\^ 


LIM Mystique 


1273425C 




MALTVDVAGPAPWGFRITGGRDFHTPIMVTKVAERGKAKDADLRPGDIIVAING 
ESAEGMLHAEAQSKIRQSPSPLRLQLDRSQATSPGQT 


LIM Protein 


3108092 




SNYSVSLVGPAPWGFRLQGGKDFNMPLTISSLKDGGKAAQANVRIGDWI_SID 
GINAQGMTHLEAQNKIKGCTGSUMIVITLQRAS 

Tl#»XM^i^**l ■ • 1^**— r%XJCI XII X^*<X^ 1 XiM^B^^I ^IVI 1 ^XMI \r^\^ 


LIMK1 


458749e 




TLVEHSKLYCGHCYYQTVWPVIEQILPDSPGSHLPHTVTLVSiPASSHGKRGL 
SVSIDPPHGPPGCGTEHSH7VRVQGVDPGCMSPDVKNSII-IVGDR1I_EINGTPIRN 

Wl V V^lii^l 1 1 l%^l i XJiXw#XMrfl i II V 1 1 V ^C^^ V bri ^■AX^IVi^^l V 1 XI ^V^ll 1 V ^dlL^I II^^hII YVjI II II 11 ^ 

VPLDEIDLLIQETSRLLQLTLEHD 


LIMK2 


1 80559^ 




PYSVTLISMPATTEGRRGFSVSVESACSNYATTVOVKEVNRMHISPNNRNAIHP 
GDRILEINGTPVRTLRVEEVEDAISQTSQTLQLLIEHD 


LliVI-RiL 


1085021 




IHSVTLRGPSPWGFRLVGRDFSAPLTISRVHAGSKASLAALCPGDLIQAINGEST 
ELIVnULEAQNRIKGCHDHLTLSVSRPE 


LU-1 


U521 1 1 




VCYRTDDEEDLGIYVGEVNPNSIAAKDGRIREGDRIIQINGVDVQNREEAVAILS 
QEENTNlSLLVARPESQi-A 


l\/IAGI1 


3370997 




IQKKNIHVVTSRVHECTVKRGPQGELGVTVLGGAEHGEFPYVGAVAAVEAAGLP 
GGGEGPRLGEGEIJXEVCK3VRVSGIJ^RYDVLGVIDSCKEA\m=KAVRC^ 


IWVGI1 


3370997 


2 


PSELKGKFIHTKlJRKSSRGFGFTVVGGDEPDEFLQIKSLVLIDGPAALDGKMErGD 
VIVSVNDTCVLGHTHAOWKIFOSIPIGASVDLELCRGYPLPFDPDDPN 


MAGI1 


3370997 


3 


PATQPEUTVHIVKGPIVIGFGFTIADSPGGGGQRVKQIVDSPRCRGLKEGDLIVEVN 
KKNVOALTHNOVVDMLVECPKGSEVTLLVORGGNLS 


IVIAGI1 


3370997 


4 


PDYQEQDIFLWRKETGFGFRILGGNEPGEPIYIGHIVPLGAADTOGRLRSGDEUCV 

DGTPVIGKSHQLWQLMOOAAKQGHVNLTVRRKWFAVPKTENSS 


IV1AG11 


3370997 


5 


GWSTWQPYDVEiRRGENEGFGFVIVSSVSRPEAGTTFAGNACVAMPHKIGRII 

EGSPADRCXSKLKVGDRIUVVNGCSITNKSHSDIVNLIKEAGNnVTLRIIPGDESSN 

A 


ISAAGll 


3370997 

WW* www* 


6 


QATQEQDFYWEi_ERGAKGFGFSIJ=lGGREYNI\^DLYVIJRUVEDGPAERCGKMRIG 

Vjv^ 1 XJifc iXMI > B V tin %m\m 1 1 IXh^^^XX^I w^I Xi/IhI 1XwmVp«4I IL^ 1 1 ^IVIb'^ 1 V kl IImt^I 1 #Vi^l J^L>i*l IX^ViAlXIVIl li^H<l 

DEILEINGETTKNMKHSRAIELIKNGGRRVRLFLKRG 


MGC5395 


BC012477 


1 


PAKMEKEETTRELLLPNWQGSGSHGLTIAQRDDGVFVQEVTQNSPAARTGW 
KEGDQIVGATIYFDNLQSGEVTQLUm/IGHl-ITVGi-KLHRKGDRSPNSS 


MINT1 


262502^ 

b w^ w w^~ 


1 


SENCKdVFIEKQKGEILGWIVESGWGSIIJ^WIIANMMHGGPAEKSGKLNIGIXDIM 

^^^hI ^V^l XVV V 1 I^hI XXjCI XXib4^m^I^^4 V V 1 V ^w^^X^ V V X^X^ 1 ^1 1 V ll#^l ^IVMVII IXn^Xi^i r^L*^l XX^Xi^l Xkal ^IX^L^^CItVI 

SINGTSLVGLPLSTCQSIIKGLKNQSRVKLNIVRCPPVNSS 


MINT1 


262502^ 

£■ Wb WWb^ 


2 


LRQPPVTTVURRPDLRYQLGFSVQNGIICSLMRGGIAERGGVRVGIHRIIEINGQSV 
VATPHEKIVHILSNAVGEIHMKTMPAAMYRLLNSS 


l\^INT3 


316980C 


i 


LSNSDNCREVHLEKRRGEGLGVALVESGWGSLLPTAVIANLLHGGPAERSGAL 
SIGDRLTAINGTSLVGLPLAAGQAAVRETKSQTSVTLSIVHCPPVTrAIIVI 


MINT3 




2 


LVHCPFVTTAIIHRPHAREOLGFCVEDGIICSLLRGGIAERGGIRVGHRIIEINGOSV 
VATPHARIIELLTEAYGEVHIKTMPAATYRLLTG 


MPP1 


189785 


1 


RKVRLIQFEKVTEEPMGITLKIJNJEKQSCTVARIIJHGGMIHRQGSLHVGDEILEING 
TNVTNHSVDQLQKAMKETKGMISLKVIPNQ 


IVIPP2 


93988^ 


1 


PVPPDAVRMVGIRKTAGEHLGVTFRVEGGELVIARILHGGiyiVAQQGIJJHVGDIlK 
EVNGQPVGSDPRALQELLRNASGSVILKILPNYQ 


IVIUPP1 


210478^ 


1 


CXaRHVEVFELI-KPPSGGLGFSWGLRSENRGELGIFVQEIQEGSVAHRDGRLKET 
DQII_AINGQALDQTITHQQAIS1LQKAKDTVQLVIARGSLPQLV 


IVIUPP1 


210478^ 


2 


PVHWQHMETiELVNDGSGLGFGIIGGKATGVIVKTlLPGGVADQHGRLCSGDHIL 
KIGDTDLAGMSSEQVAQVLRQCGNRVKLMIARGAIEERTAPT 
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MUPP1 


210478^ 


3 


OESPTFDVELTKNVOt^L GrTlAGYIGDKKLPPSGIFWSrTKSSAVPHr^ 
IAVDGTNLCK3RT^J<X)AVEVLRHTGQTVLLTLMRRGMKQEA 


MUPP1 


2104784 


4 


LNYEIWAHVSKFSENSGLGISLEATVGHHFIRSVLPEGPVGHSGKLFSGDELLE 
VNGITL LGFNHOnWIMILKFL PIFVTMVCCRRTVPPT 


MUPP1 


2104784 


5 


WEAGIQHIELEKGSKGLGFSILDYQDPIDPASTVIIIRSLVPGGIAEKDGRLLPGDR 
LMFVNDVNLENSSLEEAVEALKGAPSGTVRIGVAKPLPLSPEE 


MUPP1 


2104784 


6 


RNVSKESFERTINIAKGNSSLGIVrrVSANKDGLGMIVRSIIHGGAISR[X3RIAIGDCIL 
SINFESTISVTNAOARAMLRRHSLIGPDIKITYVPAEHLFP 


MUPP1 


2104784 


7 


L^IWNQPRRVELWREPSKSLGISIVGGRGMGSRLS^K3EVMRGIFIKH^^^ 

KNGTLKPGDRIVFVDGMDLRDASHFOAVFAIRKAGNPWFMVOSIINRPRKSPL 

PSLL 


MUPP1 


2104784 


8 


LTGELHMIFLFKGHSGLGLSLAGNKDRSRMSVFIVGIDPNGAAGKDGRLOIADFL 
LEINGQILYGRSHQNASSIIKCAPSKVKIIFIRNKDAVNQ 


MUPP1 


2104784 


9 


L SSFKNVOHLELPKDOGGL GIAISFFDTLSGVIIKSL TFHGVAATD^ 
LAVDDEIWGYPIEKFISLLKTAKMTVKLTIHAENPDSQ 


MUPP1 


21 04784 


1 0 


L PGCFTTl EISKGRTGLGLSI VGGSDTll GAIIIHFWEFGAACKDGRL WAGDOILE 
VNGIDLRKATHDEAINVLRQTPQRVRLTLYRDEAPYKE 


MUPP1 


21 04784 


"1 -| 


KPPPV/CDTLTIELOKKPGKGLGLSIVGKRNDTGVFVSDIVKGGIADADGRLMOGD 
QILMVNGEDVRNATQEAVAALLKCSLGTVTLEVGRIKAGPFHS 


MUPP1 


2104784 


1 2 


LOGLRTOFMKKGPTDSLGISIAGGVGSPLGDVPiFIAMMHPTGVAAOTOKLRVG 
DRIVTICGTSTEGMTHTQAVNLLKNASGSIEMQWAGGDVSV 


MUPP1 


2104784 


13 


LGPPCXKSm-ERGPDGLGFSIVGGYGSPHGDLPIYVKTWAKGAASETORLKRG 
DOIIAVNGOSLFGVTHEFAVAIL KRTKGTVTLMVLS 

La'v^iirx V ■ ^vji\^oi_ir.\j VII 11 1 rx v^xiu-ixri i ixv^ i v i livi v i 


NeDLG 


1086392C 


1 


IQYEEIVLERGNSGLGFSIAGGIDNPHVPDDPGIFTTKIIPGGAAAMDGRLGVNDCV 
LRVNEVEVSEWHSRAVEALKEAGPWRLVVRRRQN 


iMeULo 




d. 


m 1 \i(CX'^vccx\ i^PQiAri/^ii^Niouipr^riMQiviTVHPi^rs AA/^i^rv^pi oi/^r^ni i a\/mmt 
1 1 LJJ\or^i\oLorolAooloNUnln\j 1 rwilcooMMwrxlJonLUIoUnLJ_>\vrNi 1 

NLQDVRHEEAVASLKNTSDMWLKVAKPGSLE 






o 
o 


II 1 LJI/'/^OTT^I /^CKII\//^^CrV^C^IC\/Cdl A/^i^DAni C/^CI QD^rtOII 0\/M/^V/KII DM 

ATHEQAAAALKRAGQSVTIVAQYRPEEYSRFESKIHDLREQMMNSSMSSGSGS 

L_r 1 1 \j\ rxi 


Neurabin II 


AJ40118S 


1 


CVERLELFPVELEKDSEGLGISIIGMGAGADMGLEKLGIFVKIVrEGGAAHRDGRI 
OVNDI 1 VFVnGT^I VGX/TOQFAAWI RNTKGR\/RFMIGRFRPGFO<5F\/AORIH 
RD 


NOS1 


64252E 


1 


IQPNVISVRLFKRKVGGLGFLVKERVSKPPVIISDLIRGGAAEQSGUQAGDIILAV 
NGRPI VDI ^YD^AI FVI RGIA*?FTH\A/i II RGP 


novel PDZ 


7228177 


1 


QANSDESDIIHSVRVEKSPAGRLGFSVRGGSEHGLGIFVSKVEEGSSAERAGLGV 
GHKITEVNGI <=il F*=5TTMG^A\/KVl T?*S<=5Rl HMMVRRMGRVPGIKFSKEKNJ^ 

vji L^rxj i V 1 ' ' ^ ' 1 1 1 viVJiwrX V iX V i_ 1 ^^wwiilj iiviivi v iiriiviv^ri v i vjiiixi v^ixlixi>i^jw 


novel PDZ 


7228177 


2 


PSDTSSEDGVRRIVHLYTTSDDFCLGFNIRGGKEFGLGIYVSKVDHGGLAEENGIK 
VGnOVI AANGVRFnni^H^OAVFX/l KGOTHIMI TIKFTGRYPAYKEMNSS 


Novel Serine 


1621242 


1 


KIKKFLTESHDRQAKGKAITKKKYIGIRMMSLTSSKAKELKDRHRDFPDVISGAYII 
EVIPDTPAEAGGLKENbVIRINGO'^W'^ANDVSDVfKRESTLNMWRRGNEDI 
MITV 


Niimh RinHinn 

Protein 




\ 


PDGEITSiKINRVDPSESL^^IRLVGGSETPLVHIIIOHlYRDGVIARDGRLLPGDHLK 
VNGMDISNVPHNYAVRLLRQPCQVLWL7VMREQKFRSRNSS 


MiimKi RinHinn 

Protein 




o 

£, 


HRPRDD^FHVII NK^^PFFOI GIKI VRKVr^FPGVFIFMVI DGGVAYRHGOLEEN 
DRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPENSS 


Niimh RinHinn 

Protein 




Q 


PTITCHEKWNIOKDPGF^I GMTVAGGA<^HREWDLPIYVIS\/EPGG\/ISRDGRIK 

1 1 1 1 ^^li^ix V V i^ivxr\Lp/i^ v^^OLV.JiiVi 1 V rXx^xJiAXwrirti^v v l/^i i i v ivj v ^i^ v^Xiji v i«.ji iL^v^riiix 

TGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEFiV 


Kliimh RinHinn 

Protein 




4 


PRPII YISJPKniVI RRNITAG^I GR^IVGGYFFYNGNKPFHKmVFGTPAYNDGRIRCG 
DILLAVNGRSTSGMIHACU^RLLKELKGRITLTIVSWPGTFL 


Outer 
Membrane 


7023825 


1 


LLTEEEINLTRGPSGLGFNIVGGTDQQYVSNDSGIYVSRIKENGAAALDGRLQEG 
DKILSVNGQDLKNLLHQDAVDLFRNAGYAVSLRVQHRLQVQNGIHS 


p55T 


12733367 


1 


PVDAIRILGIHKRAGEPLGVTFRVENNDLVIARILHGGMIDRQGLLHVGDIIKEVN 
GHEVGNNPKELQELLXNISGSVTLKILPSYRDTITPCaQ 


PAR3 


8037914 


1 


DDMVKLVEVPNDGGPLGIHWPFSARGGRTLGLLVKRLEKGGKAEHENLFREN 
DCIVRINDGDLRNRRFEQAQHMFRQAMRTPKWFHWPAA 


PAR3 


8037914 


2 


GKRIJ^JIQLKKGTEGLGFSITSRDVTIGGSAPIYVKNILPRGAAIQDGRLKAGDRLIE 
VNGVDLVGKSQEEWSLLRSTKMEGTVSLLVFRQEDA 
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PAR3 


803791^ 


3 


TPDGTREFLTFEVPLNDSGSAGLGVSVKGNRSKENHADLGIFVKSIINGGAASKD 
GRLRVNDQUAVNGESLLGKTNQDAMETLRRSMSTEGNKRGMIQUVA 


PAR6 


2613011 


1 


LPETHRRVRLHKHGSDRPLGFYIRDGMSVRVAPQGLERVPGIFISRLVRGGLAES 
TGLLAVSDEILEVNGIEVAGKTLDQVTDMMVANSHNLIVTVKPANQR 


PAR6 GAMMA 


1 35371 1€ 


1 


IDVDLVPETHRRVRLHRHGCEKPLGFYIRDGASVRVTPHGLEKVPGIFISRMVPG 
GLAESTGLLAVNDEVLEVNGIEVAGKTLDQVTDMMIANSHNLIVTVKPANQR 
NNVV 


PDZ-73 


5031 97f 




RSRKLKEVRLDRLHPEGLGLSVRGGLEFGCGLFISHLIKGGQADSVGLOVGDEIV 
RINGYSISSCTHEEVINURTKKTVSIKVRHIGLIPVKSSPDEFH 


PD2-73 


503197f 


2 


IPGNRENKEKKVHSLVGSRGLGCSISSGPIQKPGIFISHVKPGSLSAEVGLEIGDOl 
VEVNGVDFSNLDHKEAVNVLKSSRSLTISIVAAAGRELFMTDEF 


PDZ-73 


5031978 


3 


PEQIMGKDVRLLRIKKEGSLDLALEGGVDSPIGfWWSAVYERGAAERHGGIVKG 
DFIMAINGKIVTDYTLAEADAALOKAWIMOGGDWIDLVVAN/CPPKEYDD 


PDZK1 


2944186 


1 


LTSTFNPRECM.SKQEGQNYGFFLR1EKDTEGHLVR\A/EKGSPAEKAGLQDGDR 

VLRINGVFVDKEEHMOWDLVRKSGNSVTLLVLDGDSYEKAGSPGIHRD 


PDZK1 


2944186 


2 


RLCYLVKEGGSYGFSLXTVQGKKGVYMTDITPCXaVAMRAGVLADDHLIEVNGE 
NVEDASHEEWEKVKKSGSRVMFLLVDKETDKREFIVTD 


PDZK1 


2944186 


3 


QFKRETASLKLLPHQPRIVEMKKGSNGYGFYLRAGSEQKGQIIKDIDSGSPAEEAG 
LKNNDLWAVNGESVETLDHDSWEMIRKGGDQTSLLWDKETDNMYRLAEF 
IVTD 


PDZK1 


29441 86 


4 


PDTTFFVDHKPKLCRLAKGENGYGFHLNAIRGL PGSFIKEVOKGGPADLAGL ED 
EDVIIEVNGVNVLDEPYEKWDRIQSSGKNVTLLVZGKNSS 


PICK1 


467841 1 


1 


PTVPGKVTLOKDAONLIGISIGGGAOYCPCLYIVOVFDNTPAALDGTVAAGDEIT 

f 1 V t \^lX V 1 ^^CIX^^^^X«CI V^IXJll^JiV«<IVJlX^I^XVJC 1 V^l 1 1 V \Jt V 1 L#l HIP ^^^X^^^^* 1 V ^^^^VJI^'^bI 1 

GVNGRSIKGKTKVEVAKMIQEVKGEVTIHYNKLQ 


PIST 


9837433C 


1 


SOGVGPIRKVLiiKEDHEGLGISITGGKEHGVPIUSEIHPGOPADRCGGLH^ 
LAVNGVNLRDTKHKEAVTILSQQRGEIEFENAAVAPEVDSD 


prlL16 


1478492 


1 


1 H VTl LHKEEGAGLGFSLAGGADLENKVITVHR VFPNGLASQEGTIQKGNEVLSI 
NGKSLKGTTHHDALAILRQAREPRQAVIVTRKLTPEEFIVTD 


prlL16 


1478492 


2 


TAEATVCTVTLEKMSAGLGFSLEGGKGSLHGDKPLTINRIFKGAASEOSETVOP 
GDEILQLGGTAMQGLTRFEAWNIIKALPDGPVTIVIRRKSLQSK 


PSD95 


331 8652 




LEYEpITLERGNSGLGFSIAGGTDNPHIGDDPSIFITKIIPGGAAAODGRLRVNDSIL 

1 1 1 L^^l 1 L^L-I 1x^1 ^^^\M«4ip,\_JI Vi^l/^ViJV.^ i L^l^l 1 IIV^L^L^I ^^11 1 1 1 XI 1 1 \_<lv^/^r^rXVj<L^X^ 1 1^1 1 V 1 HL^Vp/IIm 

FVNEVDVREVTHSAAVEALKEAGSIVRLYVMRRKPPAENSS 


PSD95 


331 8655 


2 


HVMRRKPPAEKVMEIKUKGPKGLGFSIAGGVGNOHIPGDNSIYVTKIIEGGAAHK 
DGRLQIGDKILAVNSVGLEDVMHEDAVAALKNTYDWYLKVAKPSNAYL 




331 8655 


3 


rediprfprrivihrg^tgi gfnivggerx^egifisfiiaggpadlsgelrkgnoilsv 
NGVDLRNASHEQAAIALKNAGQTVTIIAQYKPEFIVTD 




179912 


■l 


[ IRrrPDFDGKFGFNI KGGVDOKMPLWSRINPESPAnTCIPKLNFGDOIVI INGR 
DISEHTHDQWMFIKASRESHSRELALVIRRR 


PTN-4 


190747 


1 


IRMKPnENGRFGFNVKGGYrK^KMPVIVSRVAPGTPADLCVPRLlVlEG[V5WLINIG 
RDIAEHTHDQWLFIKASCERHSGELMLLVRPNA 


PTPL1 


51 503C 


1 


PEREirLVNLKKDAKYGLGFOIIGGEKMGRLDLGIFISSVAPGGPADFHGCLKPGDR 
LISVNSVSLEGVSHHAAIEILQNAPEDVTLVISQPKEKISKVPSTPVHL 


PTPL1 


51503C 


2 


GDIFEVELAKNDNSLGISVTGGVNTSVRHGGIYVKAVIPOGAAESDGRIHKGDRV 
LAVNGVSLEGATHKQAVETLRNTGQVVHLLLEKGQSPTSK 


PTPL1 


51 503C 


3 


TEENTFEVKL FKNS^GI GF55FSREDNLIPE01NASIVRVKKLFAGOPAAESGKID\/ 
GDVILKVNGASLKGLSQQEVISALRGTAPEVFLLLCRPPPGVLPEIDT 


PTPL1 


51503C 


4 


ELEVELLITLIKSEKASLGFTVTKGNQRIGCYVHDVIQDPAKSDGRLKPGDRLIKV 

V ^.^^1 1 ^11 XVJ1«>»I XA^%JL.\^ 1 IVII XXi^l ^XjCI IIX^V/ IVII V IXjCI-^I /^r N\«#L^X^I 1^1 Xi ^^L^l 1^1 1 X ▼ 

NDTDVTNMTHTDAVNLLRAASKTVRLVIGRVLELPRIPMLPH 


PTPL1 


51 5n3r 


5 


ML PHI L PnrnTCNKFFLGFSL CGGHDSLYOWYISDINPRSVAAIEGNLOLLDV 
IHYVNGVSTQGMTLEEVNRALDMSLPSLVLKATRNDLPV 


RGS12 


3290015 


1 


RPSPPRVRSVEVARGRAGYGFTLSGQAPCVLSGVMRGSPADFVGLRAGDQILA 
VNEINVKKASHEDWKLIGKCSGVLHMVIAEGVGRFESCS 


RGS3 


18644735 


1 


LCSERRYRQITIPRGKDGFGFTICCDSPVRVQAVDSGGPAERAGLQQLDTVLQL 
NERPVEHWKCVEU\HEIRSCPSEIILLVWRMVPQVKPGIHRD 


Rhophilin-like 


14279406 


1 


ISFSANKRWTPPRSIRFTAEEGDLGFTLRGNAPVQVHFLDPYGSASVAGAREGD 
YIVSIQLVDCKWLTLSEVMKLLKSFGEDEIEMKWSLLDSTSSMHNKSAT 


Serine 
Protease 


273891^ 


1 


RGEKKNSSSGISGSQRRYIGVMMLTLSPSILAELQLREPSFPDVQHGVLIHKVILG 
SPAHRAGLRPGDVILAIGEQMVQNAEDVYEAVRTQSQLAVQIRRGRETLTLYV 
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cuank 1 
oiiciiii\ 1 




•| 


FFKTWLOKKDNFGFGFVLRRAKADTPIFFFTPTPAFPALOYLESVDEGGVAW 
QAGLRTGDFLIEVNNENWKVGHRQWNMIRQGGNHLVLKVVTVTRNLDPDD 
TARKKA 


Shank 3 




1 


SDYVIDDKVAVLQKRDHEGFGFVLRGAKAETPIEEFTPTPAFPALQYLESVDVE 

GVAWRAGLRTGDFLIEVNGVNWKVGHKQWALIRQGGNRLVMKWSVTRKP 
EEDG 


Shroom 


18652856 


1 


lYLEAFLEGGAPWGFTLKGGLEHGEPLIISKVEEGGKADTLSSKLQAGDEWHIN 
EVTLSSSRKEAVSLVKGSYKTLRLWRRDVCTDPGH 


SIP1 


2047327 


1 


IRLCRLVRGEQGYGFHLHGEKGRRGQFIRRVEPGSPAEAAALRAGDRLVEVNGV 
NVEGETHHQVVQRIKAVEGQTRLLVVDQN 


SIP1 


2047327 


2 


IRHLRKGPQGYGFNLHSDKSRPGQYIRSVDPGSPAARSGLRAQDRLIEVNGQNV 
EGLRHAEVVASIKAREDEARLLVVDPETDE 


SITAC-18 


8886071 


1 


PGVRE1HLCKDERGKTGLRLRKVDQGLFVQLVQANTPASLVGLRFGDQLLQICX3 
REX^AGWSSHKAHQWKKASGDKIWWRDRPFQRTVTM 


SITAC-18 


8886071 


2 


PFQRTSni^HKDSMGHVGFVIKKGKIVSLVKGSSAARNGLLTNHYVCEVDGQNV 
IGLKDKKIMEILATAGNWTLTIIPSVIYEHIVEFIV 


SSTRIP 


702545C 


1 


LKEKTVLLQKKDSEGFGFVLRGAKAQTPIEEFTPTPAFPALQYLESVDEGGVAW 

RAGLRMGDFUEVNGQNWKVGHRQWNMIRQGGNTLMVKWMVTRHPDMD 

EAVQ 


SYNTENIN 


2795862 


1 


LEIKCX3IREVILCKDQDGKIGLRLKSIDNGIFVQLVQANSPASLVGLRFGDQVLQI 
NGENCAGWSSDKAHKVLKQAFGEKmvlRIHRD 


SYNTENIN 


2795862 


2 


RDRPFERTITTVIHKDSTGHVGFIFKNGKITSIVKDSSAARNGLLTEHNICEINGQNVI 
GLKDSQIADILSTSGNSS 


ftvntrnnhin 1 

w y 1 1 11 III 1 1 

alpha 


1 1 45727 


1 


QRRRVTVRKADAGGLGISIKGGRENKMPILISKIFKGLAADQTEALFVGDAILSVN 
GEDLSSATHDEAVQVLKKTGKEWLEVKYMKDVSPYFK 


SvntroDhin 

yi III III 1 

beta 2 


47670C 


1 


IRWKQEAGGLGISIKGGRENRMPILISKIFPGLAADQSRALRLGDAILSVNGTDLR 
QATHDQAVQALKRAGKEVLLEVKFIREFIVTD 


SvntroDhin 

\mf Villi t 1 ff 1 1 

gamma 1 


95071 65 


1 


EPFYSGERTVTIRRQTVGGFGLSIKGGAEHNIPWVSKISKEQRAELSGLLFIGDAi 
LQINGINVRKCRHEEVVQVLRNAGEEVTLTVSFLKRAPAFLKLP 


Syntrophin 
gamma 2 


950716^ 


1 


SHQGRNRRT\niJ^RQPVGGLGLSIKGGSBHNVPVVISKIFEDQAADQTGMLfVG 
DAVLQVNGIHVENATHEEVVHLLRNAGDEVTITVEYLREAPAFLK 


TAX2-like 
protein 


32531 16 


1 


RGETKEVEVTKTEDALGLTITDNGAGYAFIKRIKEGSIINRIEAVCVGDSIEAINDH 
SIVGCRHYEVAKMLRELPKSQPFTLRLVQPKRAF 


TIAM 1 


450750C 


1 


HSIHIEKSDTAADTYGFSLSSVEEDGIRRLYVNSVKETGLASKKGLKAGDEILEIN 
NRAADALNSSMLKDFLSQPSLGLLVRTYPELE 


TIAM 2 


691270G 


1 


PLNVYDVQLTKTGSVCDFGFAVTAQVDERQHLSRIFISDVLPDGLAYGEGLRKG 
NEIMTLNGEAVSDLDLKQMEALFSEKSVGLTLIARPPDTKATL 


TIP1 


2613001 


1 


QRVEIHKLRQGENLILGFSIGGGITODPSQNPFSEDKTDKGIYVTFIVSEGGPAEIA 
GLQIGDKIMQVNGWDMTMVnTHDQARKRLTKRSEEWRU-VTOQSLQK 


TIP2 


261300C 


1 


RKEVEVFKSEDALGLTITDNGAGYAFIKRIKEGSVIDHIHIJSVGDMIEAINGQSLL 
GCRHYEVARLLKELPRGRTFTIXLTEPRK 


TIP33 


2613007 


1 


HSHPRWELPKTDEGLGFNVMGGKEQNSPIYISRIIPGGVAERHGGLKRGDCM-LS 
VNGVSVEGEHHEKAVELLKAAKDSVKLWRYTPKVL 


TIP43 


261301 1 


1 


ISNQKRGVKVLKQELGGLGISIKQGKENKMPILISKIFKGU^ADQTQALYVGDAILS 
VNGADLRDATHDEAVQALKRAGKEVLLEVKYMREATPYV 


X-11 beta 


300555S 


1 


IHFSNSENCKELQLEKHKGEILGNAAA/ESGWGSILPTVILANMMNGGPAARSGK 
LSIGCXilMSINGTSLVGLPLATCQGIlKGLKNQTQVKLNIVSCPPVTTVLIKRNSS 


X-11 beta 


300555S 


2 


IPPVTWLIKRPDLKYQLGFSVQNGIICSUVIRGGIAERGGVRVGHRIIEINGQSVVA 
TAHEKIVQALSNSVGEIHMKTMPAAMFRLLTGQENSS 


ZO-1 


292937 


1 


IVy^QKTVTUHRAPGFGFGIAISGGRDNPHFQSGETSIVISDVLKGGPAEGQLQEN 
DRVAMVNGVSMDNVEHAFAVQQLRKSGKNAKITIRRKKKVQIPNSS 


ZO-1 


292937 


2 


ISSQPAKPTKVTLVKSRKNEEYGLRUVSHIFVKEISQDSLAARDGNIQEGDWLX 
INGTVTENMSLTDAKTLIERSKGKLKMVVQRDRATLLNSS 


ZO-1 


292937 


3 


IRMKLVKFRKGDSVGLRLAGGNDVGIFVAGVLEDSPAAKEGLEEGDQILRVNNV 
DFTNIIREEAVLFLLDLPKGEEVTILAQKKKDVFSN 


ZO-2 


1273476G 


1 


LIWEQYTVTLQKDSKRGFGIAVSGGRDNPHFENGETSIVISDVLPGGPADGLLQE 
NDRWMVNGTPMEDVLHSFAVQQLRKSGKVAAIWKRPRKV 
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ZO-2 


1273476c 


2 


RVLLMKSRANEEYGLRLGSQIFVKEMTRTGLATKDGNUHEGDIIIJ<INGTVTENM 
SLTDARKLIEKSRGKLQLWLRDS 


ZO-2 


1273476G 


3 


HAPNTKMVRFKKGDSVGLRLAGGNDVGIFVAGIQEGTSAEQEGLQEGDQILKVN 
TQDFRGLVREDAVLYLLEIPKGEMVTILAQSRADVY 


ZO-3 


1009269C 


1 


IPGNSTIWB3HTATLSKDPRRGFGIAISGGRDRPGGSMWSDWPGGPAEGRLQT 
GDHIVMVNGVSMENATSAFAIQILKTCTKMANITVKRPRRIHLPAEFIVTD 


ZO-3 


1009269C 


2 


QDVQMKPVKSVLVKRRDSEEFGVKLGSQIFIKHITDSGLAARHRGLQEGDULQI 
NGVSSQNLSLNDTRRLIEKSEGKLSLLVLRDRGQFLVNIPNSS 


ZO-3 


1009269C 


3 


RGYSPDTRWRFIXGKSIGLRLAGGNDVGIFVSGVQAGSPACX3QGIQEGDQILQV 
NDVPFQNLTREEAVQFLLGLPPGEEMELVTQRKQDIFWKMVQSEFIVTD 


*: No Gl number for this PDZ domain containing protein • it was computer cloned by J.S. using rat Shanl(3 
seq against human genomic clone AC000036. In silico spliced together nt6400-6496, 6985-7109, 7211-7400 

to create hypothetical human Shank3. 
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